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PREFACE 


SPONSORSHIP OF THE WORK 

This volume was prepared by the Statistical Research Group, Divi¬ 
sion of War Research, Columbia University, for the Applied Mathematics 
Panel, National Defense Research Committee, Office of Scientific 
Research and Development. The work was initiated under contract 
OEMsr-618 and completed under contract OEMsr-1131, both between 
the Trustees of Columbia University and the Office of Scientific Research 
and Development. Royalties, at usual rates, are being paid by the 
publisher to the United States government; no royalties are being paid to 
Colombia University or to any private institution or individual. The 
publisher has agreed that ten years after the date on which this volume is 
issued the copyright thereon shall be relinquished and the work shall 
berome part of the public domain. 

The Statistical Research Group was organized and its initial staff 
selected by Warren Weaver, Chief of the Applied Mathematics 
Panel, on July 1, 1942. The staff was drawn from universities and 
research organizations throughout the country, many of which generously 
granted leaves for the work. The group disbanded September 30, 1945, 
except for a few persons who have prepared this and another volume as 
monographs in connection with the Applied Mathematics PanePs Sum¬ 
mary Technical Report. 


NATURE OF THE BOOK 

This book discusses a series of problems that occur frequently in 
planning, analyzing, or interpreting quantitative data. Various tech¬ 
niques appropriate to these problems are explained, in terms of both 
general principles and specific procedures. 

The book is written for scientists, engineers, managers, and statis¬ 
ticians. It seeks to explain the statistical approach to various problems, 
to expound the theory that can be brought to bear on each problem, to 
point out the features of the problem that determine the relevance of the 
theory, to give detailed and workable computational procedures, includ¬ 
ing formulas, tables, and charts, and to illustrate the interpretation of 
the results. Concrete illustrative material is used to bring out the 
interplay of substantive and statistical considerations, the illustrations 
being selected primarily for the light they throw on the art of applying 
statistical techniques and only secondarily for intrinsic interest. Although 
relatively little new statistical theory is included, several topics are given 
more thoroughly integrated and carefully expounded treatment than has 
previously been available, and many new tables are included. 

vii 



viii 


PREFACE 


In preparing this book, the authors have kept in mind the reader who 
is interested in applying statistical techniques in his work but who either 
does not have much interest in or time for studying general statistical 
theory as such, or else feels that he can best understand and appreciate 
statistical theory through a series of special problems. Even the statis¬ 
tical theorist, however, should be able to deepen and broaden his under¬ 
standing and appreciation of statistical theory by examining special 
problems in some detail. 

The chemical, physical, and engineering sciences are, on the whole, 
distinctly backward in the statistical planning of experiments and the 
analysis of empirical data, at least by comparison with the social, biologi¬ 
cal, and agricultural sciences. No doubt this is partly because physical 
scientists have attained a high degree of proficiency in experimental 
techniques and can make repeated, independent experimental investiga¬ 
tions of a given phenomenon—they can “think with their hands”—so 
that competent investigators are likely to learn the truth (though perhaps 
belatedly and at excessive cost) despite crudities in statistical techniques. 
In the social and biological sciences, on the other hand, repetition or 
control of experimental procedures is usually impossible, at least in any¬ 
thing like the degree achieved in the physical sciences, so that the truth 
can be learned only through highly developed statistical techniques 
closely coupled (as statistical techniques must always be) with sound 
scientific knowledge and techniques. But even though sound statistical 
techniques may not be indispensable in the physical sciences, they are 
invaluable. Increasing recognition of this, accelerated by developments 
during the war, is leading to the rapid introduction of statistical methods 
into chemistry, physics, and engineering; indeed, it seems likely that in 
the immediate future the greatest contributions of statistics will be to, 
and its greatest developments will arise from, the physical sciences. 
(The physical sciences may, however, yield this distinction to medicine, 
which is said to be nearly as dependent on statistics as the social sciences 
and nearly as backward statistically as the physical sciences.) 

BACKGROUND 

During the war the Statistical Research Group carried on three kinds 
of work: (1) It advised and assisted the Army, the Navy, and the Office 
of Scientific Research and Development on the statistical aspects of 
problems arising in their activities; this constituted the largest part of 
the work. (2) It assumed primary responsibility for the investigation of 
certain problems of a predominantly statistical or probabilistic character. 
(3) It developed, reviewed, or expounded statistical techniques; this was 
the smallest part of the work and was incidental to the first two. 
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IX 


The great bulk of the work fell into the first two categories, and the 
final products did not emphasize statistical techniques as such. From 
this work, however, there arose a number of developments in applied 
statistics* as well as some in theoretical statistics. This book has been 
prepared to preserve and to disseminate useful statistical material that 
was almost a by-product of the responsible and cooperative handling of 
practical military, scientific, engineering, and production problems. 

AUTHORSHIP 

Because it would be difficult or impossible accurately to assign per¬ 
sonal credit and responsibility for this volu ne, its authorship is attributed 
to the Statistical Research Group as a whole. Under ordinary circum¬ 
stances the basic material of the various chapters would have been 
published in technical journals by the originators individually. Under 
the actual circumstances, many ideas are presented here for the first 
time in chapters prepared by others than those who did the basic work. 
It thus becomes important to assign credit for the ideas in such chapters 
without at the same time implying responsibility for the entire content 
or emphasis- An attempt has been made to do this by notes at the 
beginning of the chapters. The editors, and in particular the under¬ 
signed, however, have exercised final judgment and authority over the 
form and content of each chapter and are thus in the unhappy position 
of having to assume all responsibility for the shortcomings of the book 
without being able to claim credit for its merits. 

The following were members of the senior scientific staff of the Statistical 
Research Group, and the influence of each is reflected in the book. 

Kenneth J. Arnold, University of Wisconsin 
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CHAPTER 1 


USE OF VARIABLES IN ACCEPTANCE INSPECTION 
FOR PERCENT DEFECTIVE 

1. ATTRIBUTES AND VARIABLES 

Acceptance inspection is the process of judging whether a lot of items 
is of acceptable quality by examining a sample from the lot. Such inspec¬ 
tion is widely and successfully used in industry. It is usually carried out 
by classifying each item in the sample as either defective or nondefective, 
then deciding from* the number of defective items in the sample whether 
the percentage of defective items in the lot is small enough for the lot to 
be considered acceptable. Thus, the quality of the lot is measured by the 
percentage of defective items, and this is judged by classifying the sample 
items according to attributes. 

Often the sample items are classified as defective or nondefective 
on the basis of a measurable quality characteristic. For example, the 
items may be classified as too heavy or not, although the actual weight 
could be measured; they may be classified as too short or not, although the 
length could be measured; they may be classified as having an excessive 
volatile content or not, although the volatile content could be measured; 
they may be classified as operating too fast or not, although the speed 
could be measured; they may be classified as having too short a useful life 
or not, although the useful life could be measured. Such cases, where 
inspection is based on attributes, may be contrasted with cases in which 
inspection is based on variables, that is, on the actual measured values of 
the characteristic. 

Inspection by variables has one major advantage over inspection by 
attributes: fewer items need be inspected for a given degree of accuracy 
in judging a lot’s acceptability; or, conversely, decisions about the lot 
are more reliable if based on a sample inspected by variables than if 
based on a sample of the same size inspected by attributes. The reason 
for this is that variables inspection provides more information about 
the quality of each item; for example, that the volatile content is 16.02 
percent or 20.61 percent, as the case may be, instead of merely that the 
volatile content exceeds, say, 16 percent in both cases. Since variables 
inspection provides more information about each item, a smaller number of 
sample items gives the same amount of information about the lot, or the 
same number of sample items gives more information about the lot. 

7 
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The principal advantage of inspection by attributes over inspection 
by variables is that it requires less skill, less time, or less expensive equip¬ 
ment to inspect each item; less record-keeping to record the results of 
inspection; less arithmetic to determine from the sample results whether 
the lot should be judged acceptable. Usually this advantage of simplicity 
more than offsets the fact that more items must be inspected under 
attribute than under variables inspection. 

Another advantage of attribute inspection is that the selection and 
installation of sound sampling plans is easier. In fact, complete and 
detailed procedures are available in such form that little or no previous 
knowledge of technical statistics is required for their successful use.* In 
attribute inspection the mathematical theory about the manner in which 
quality varies from one item to another is always fulfilled if the sample is 
selected at random frofn the lot. The mathematical theory underlying 
variables inspection plans, on the other hand, is that quality varies among 
the items of the lot in accordance with a normal (Gaussian) distribution; 
whether this theory is a sufficiently good approximation to the facts in a 
particular inspection problem is a matter requiring investigation by 
statistical methods. 

There are other less important reasons for preferring attributes to 
variables. One of these is a misapprehension as to the amount of arith¬ 
metic required, often based in large part on ignorance of short-cut 
computational procedures. Another such reason is exaggeration of the 
limitations imposed by the normality assumption in the mathematics 
underlying variables inspection. 

Attributes are sometimes used instead of variables because well-known 
sampling plans for variables measure lot quality by the average or the 
variability of the measurements, while well-known attribute plans meas¬ 
ure it by the percent defective. Percent defective is often the appropriate 
measure, and its appropriateness is not affected by shifting between 
attributes and variables. For example, in inspecting rods for length, the 
percent too short may in some circumstances be a better measure of 
quality than either the mean or the variability of length—as when 
rods below a certain length are useless while those above that length can 
easily be trimmed at virtually no extra cost. Similarly, in determining 
whether individual packages in a lot provide full measure, the percent 
below the specified measure may be more significant than the average 
measure. Variables plans for percent defective exist but are not widely 
known, t 

* See, for example, Statistical Research Group, Sampling Inspection; Dodge and 
Romig, Sampling Inspection Tables (for full citations see Sec. 13). 

t See especially the following two papers, of which this chapter is primarily a 
development: Jennett and Welch, “The Control of Proportion Defective”; Johnson 
and W elch, “ Applications of the Non-central t Distribution,” 
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Although in most cases attribute plans are undoubtedly more economi¬ 
cal than variables plans for percent defective, because the lower cost of 
inspecting each item usually more than offsets the increase in the number 
of items that must be inspected, variables inspection offers decided 
advantages in far more instances than has usually been recognized. This 
is especially true when the inspection process is costly, as for example 
when it requires expensive time, equipment, or personnel, or when the 
items are expensive and are damaged or destroyed by inspection. It is 
also true when measurement by variables costs only a little more than 
classification by attributes, or when for purposes other than acceptance 
inspection the data on variables are more valuable than those on attrib¬ 
utes. Thus it is desirable that the practical aspects of variables plans be 
more fully developed and the plans made more generally available than 
has heretofore been the case. 

2. BACKGROUND OF THE PLAN 

A variables plan of acceptance inspection for percent defective is based 
upon the sample mean and the sample standard deviation. Other meas¬ 
ures of central tendency and variability could be used, for example, the 
median and the range. No other measures are considered in this chapter, 
however, because other measures all require larger samples to give as good 
results as the mean and standard deviation. The gains in computational 
simplicity that may be afforded by other measures are likely to be unim¬ 
portant in the situations for which variables inspection is appropriate. 
Although in routine inspection operations it is not necessary to compute 
either the mean or the standard deviation, it will clarify the ideas to begin 
the discussion in terms of them. 

A normal distribution, such as that shown in Fig. 1.1, is characterized 
by two quantities (parameters), its mean ju and its standard deviation a. 
Normal distributions have large or small means according as n is large or 
small; the observations are tightly clustered or widely scattered about the 
mean according as <r is small or large. The proportion of the observations 
exceeding a certain upper limit, say U , depends upon how far the limit is 
above the mean, measured in terms of the standard deviation. For 
example, if the limit exceeds the mean by twice the standard deviation, 
that is, if U = n + 2 a, 2.275 percent of the observations exceed the limit; 
but if the limit exceeds the mean by three times the standard deviation, 
that is, if U = n + 3<r, only 0.135 percent of the observations exceed the 
limit. To determine the percent of observations exceeding any limit, first 
subtract the mean from the limit and then divide the difference by the 
standard deviation; that is, compute 
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which shows the difference between the limit and the mean as a multiple of 
the standard deviation. Various tables show for different values of K 
what part of a normal distribution exceeds K* 



Fig. 1.1.—Normal distribution. 


From formula (1) it is clear that the proportion exceeding a given 
limit U } which we will call the fraction defective, depends upon both the 
mean and the standard deviation. For a given standard deviation, the 
fraction defective is smaller the smaller the mean; but, however small 
the mean, the standard deviation may be large enough to cause a high 

* See, for example, Table 1.1 in this chapter. By far the most extensive table is 
that prepared by the New York Mathematical Tables Project, Tables of Probability 
Functions. This gives probabilities to 15 decimal places for values of K (denoted by 
x in the tables) from 0 to 1 by steps of 0.0001, and for all values of K above 1 by steps 
of 0.001. 

Some care is required in using tables of the normal probability function, since 
two different kinds of table are common and several methods of tabulation are used. 
The two kinds of table differ in the units used to measure the difference between the 
limit and the mean, the most common type (illustrated by Vol. II of the NYMTP 
tables) measuring this as a multiple of the standard deviation and the other type 
(illustrated by Vol. I of the NYMTP tables) measuring it as a multiple of y/2 times 
the standard deviation. Thus, to enter the second type of table, K computed from 
formula (1) must be divided by \/2 = 1.414. In either type of table the entries may 
represent (a) the proportion exceeding K, which is the same as the proportion below 
— K; (6) the proportion below K } which is the same as the proportion above —K; 
(c) the proportion between the mean and K> which is the same as the proportion 
between the mean and — K ; (d) the proportion between — K and K; ( e ) the proportion 
below — K and above + K. Given any one of these five values, the others are easily 
computed from the facts that the distribution is symmetrical about the mean and 
that the total proportion is one. 
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fraction defective. For a fixed mean less than the limit, the fraction 
defective is smaller the smaller the standard deviation; but, however 
small the standard deviation, the mean may be large enough to cause a 
high fraction defective. The acceptability of any mean, when quality 
is measured by fraction defective, depends on the standard deviation; and 
the acceptability of any standard deviation depends on the mean. 

Figure 1.2 illustrates various combinations of the mean and standard 
deviation that correspond to the same lot quality, as measured by fraction 


<r 



Fig. 1.2.—Combinations of /x and <r corresponding with the same percent defective. 

defective. Any combination corresponding to a point on the sloping 
line represents the same quality; combinations to the left of the line 
represent better quality (lower fraction defective), those to the right worse 
quality. Any lot quality can be represented by such a line, the percent 
defective being greater the steeper the line.* Thus a lot is accept¬ 
able, in the sense that it has a fraction defective less than or equal 
to a specified acceptable quality level (AQL), whenever (U — n)/a is 
greater than or equal to a certain value of K which depends on the AQL 
and can be determined from tables of the normal probability function. 
In other words, the lot is acceptable if its mean plus a certain multiple of 
its standard deviation does not exceed the limit; that is, if 

H + K<r < U (2) 

♦Lot qualities worse than 50 percent defective would be represented by a line 
starting at fi * U, <r * 0 and sloping upward to the right. When the mean exceeds 
the limit the percent defective exceeds 50, and for a given mean the percent defective 
is smaller the larger the standard deviation. Such cases are rarely of interest in 
acceptance sampling but fcan be treated by formulas (1'), (2'), and (3') below. 
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The discussion so far has been entirely in terms of the lot mean and 
standard deviation. In practice, the lot mean and standard deviation, 
H and <r, are unknown, and it is necessary to judge from the sample mean 
and standard deviation, x and s , whether they satisfy criterion (2). This 
can be done by adding a multiple of the sample standard deviation to the 
sample mean and comparing the sum with the limit. In choosing the 
multiple of the sample standard deviation, however, allowance must be 
made for the fact that the mean and standard deviation determined from 
the sample are subject to fluctuations from sample to sample. That is, if 
a number of samples are drawn from lots having exactly the same fraction 
defective, the samples will not all yield the same value of (17 — x)/s . 
Thus the acceptable level of (U — x) /s, say k , must be small enough so 
that there will not be too high a probability that an acceptable lot will 
fail to meet the acceptance criterion for the sample, namely, 

* x + ks < U (3) 

If the sample contains measurements on N items (that is, contains 
N observations) represented by Xi, x 2 , * • * x N , the sample mean x is 

x = - N - = TT (4) 

the sample variance s 2 is 

,i (zt — x) 2 + - x) 2 + ■ ■ • + (a;* — x) 2 Sfo — x) 2 /BS 

s =- F __-= .JV-T” (5) 

and the sample standard deviation s is the square root of the sample 
variance. 

A few miscellaneous remarks will complete the background of vari¬ 
ables inspection plans for percent defective: 

The symbols ju and <r are used to indicate the population mean and 
standard deviation; x and s are used to represent the sample mean and 
standard deviation. Similarly, K is used to represent the deviation 
of the lot mean from the limit in units of the lot standard deviation, and 
k is used to represent the deviation of the sample mean from the limit in 
units of the sample standard deviation. This is in conformity with 
the sound and increasingly prevalent practice of using notation that 
emphasizes the distinction between true lot (population or universe) 
parameters and sample estimates of these parameters—an estimate mean¬ 
ing in statistical usage a value calculated from a sample (not guessed or 
judged in the everyday sense of “ estimate”) according to established 
principles for estimating lot parameters from sample data. Neglect of 
the distinction between lot parameters and sample estimates of them 
can be (and often is) a cause of serious confusion. .The quantities £ and 
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s, which for brevity have been called the sample mean and standard 
deviation, should properly be called the sample estimates of the lot mean 
and standard deviation. 

The use of N — 1 instead of N in the denominator of the variance 
[see formula (5)] has no real influence on the sampling plan provided the k 
of expression (3) is chosen with due regard to whether N or N — 1 is used. 
If N is used, the proper value of k is greater by a factor y/W/(N — 1) 
than if N — 1 is used, so k times s is the same in either case. If, however, 
the value of s is itself of interest, the use of N — 1 is preferable. 

Although formula (5) is useful for bringing out the meaning of the 
standard deviation, it should not be used for computing purposes. For 
computing, use one of the following two forms: 


8 * = 


2j 2 — 

N 

N - 1 

NXx 2 - (2x) 2 

~ N(N - 1) 


(ба) 

( бб ) 


For purposes of acceptance inspection, however, it is not necessary (as 
will be seen later) actually to compute the standard deviation. 

For concreteness it is assumed that defective items are those for which 
the value of the quality characteristic exceeds some upper bound of 
satisfactory quality 17. It would be equally appropriate to assume that 
defective items are those below some minimum M. In that case, equa¬ 
tions (1), (2), and (3) would be 

<T 

fJL — K* > M 

x — ks > M 

In what follows, the assumption is retained that defectives are items that 
are too large, but from equations (T), (2'), and (3') the application to 
cases where defectives are items that are too small will be obvious. 


d') 

( 2 ') 

O') 


3. THREE PHASES OF A VARIABLES SAMPLING PLAN 
3.1. The Three Phases 

A sampling procedure may be analyzed into three phases: 

(1) The plan of action, that is, the set of rules on the basis of which 
to accept or reject the lot.* 

(2) The amount of inspection required by the plan, that is, the number 
of items that must be inspected from each lot. 

♦ Some plans allow a third decision, to inspect more items: see Sec. 11.6. 
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(3) The operating characteristics of the plan, that is, the proportions 
of submitted lots of various qualities that will be accepted and rejected 
if the plan is used. 

3.2. Plan of Action 

The plan of action for variables inspection for percent defective is as 
follows: 

a. A sample of N items is selected at random from the lot. 

b. The quality characteristic being inspected is measured and recorded 
for each item of the sample. 

c. The mean and standard deviation are computed from the N 
measurements. 

d. The sample standard deviation is multiplied by a certain constant 
k and added to the sample mean. 

e. The lot is rejected if the value of x + ks found in step d exceeds U , 
the limit of the quality characteristic above which an item is considered 
defective; otherwise, the lot is accepted. 

In practice, steps c and d can be replaced by a single step that consists 
essentially of adding the squared values of the N sample measurements; 
in that case step e remains essentially as stated, though the actual 
numbers used in it are different. 

Thus, the plan of action is characterized by two parameters: the 
sample size N and the multiplier of the standard deviation k. ( U is 
associated with the definition of quality rather than with the sampling 
plan, since it simply defines a defective item.) 

3.3. Amount of Inspection 

Under the plans considered here, which are single-sampling plans, the 
number of items inspected per lot is always the same for a given plan. 

Single-sampling plans for fraction defective based on attributes can 
sometimes be curtailed, that is, terminated without inspecting all N 
items. Curtailing is possible as soon as the number of defectives reaches 
the rejection number or the number of defectives plus the number of 
items remaining to be inspected is less than the rejection number. Plans 
based on variables can also be curtailed, provided there is some minimum 
possible value of the characteristic being inspected. Most acceptance 
inspection involves quantities that can under no circumstances be less 
than zero, so curtailing before inspecting all N items is technically pos¬ 
sible. The simplest example of this possibility is the case in which, with 
k positive, the sum of fewer than N measurements exceeds NU, making 
it clear that for the entire N items x will exceed U and x + ks will exceed 
it still further. That there are other cases, less extreme but more compli¬ 
cated to describe, in which curtailing is possible can be seen by studying 
Sec. 11.3 below. Ordinarily, however, curtailing is not considered 



Sec. 3.4 


VARIABLES 


15 


because computations are not started until all measurements have been 
made. When it is possible to carry out simple computations while the 
data are being collected and it is important to minimize the amount of 
inspection, sequential analysis of the data will be considerably more 
efficient than merely curtailing a single-sampling plan. A brief outline 
of a plan for sequential analysis of fraction defective by variables is 
presented in Sec. 11.6. 

3.4. Operating Characteristics (OC) 

There are evidently many possible plans of action, for each different 
combination of sample size and niultipli;>i of the standard deviation 



Fro. 1.3.—Typical operating-characteristic (OC) curve. 

gives a different plan of action. If a large number of lots containing a 
given fraction defective p *are submitted to a given plan, a certain pro¬ 
portion L p are accepted; and this proportion differs from one plan to 
another. For each plan of action, it is important to know what propor¬ 
tion of submitted lots will be accepted for each possible quality p of 
submitted lots, as a plan is clearly unsuitable if it passes too many of the 
lots of unsatisfactory quality or rejects too many of the lots of acceptable 
quality that are submitted to it. 

This information is shown by the operating-characteristic (OC) curve 
of the plan. The ordinate of the OC curve shows the proportion L p of 
lots that will be accepted when the fraction defective in submitted lots 
is p. The proportion of submitted lots of quality p that will be rejected 
is 1 — L p . The OC curve of Fig. 1.3 is typical: it shows a probability of 
1 of accepting a lot having no defectives. As the fraction defective 
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increases, the probability of acceptance declines, slowly at first, then more 
rapidly, and finally flattens out. It shows a probability of 0 of accepting 
a lot having only defectives. 

It should be emphasized that the OC curve does not show the proba¬ 
bility that an accepted lot will be of quality p; this depends upon the 
probability that a lot of quality p will be submitted, as well as upon the 
probability that a lot of quality p will be accepted if submitted. The OC 
curve shows only the probability that a lot of quality p will be accepted 
if submitted. 

The OC curve may be thought of in another way, which, though it 
requires visualizing a three-dimensional chart, is illuminating. Consider 
Fig. 1.2, in which the lot mean n is plotted on the horizontal axis and 
ohe lot standard deviation <r on the vertical axis. Any point in this 
figure for which a > 0 represents a possible lot. Imagine a line perpen¬ 
dicular to the plane of the page through some point n,cr. The height of 
the perpendicular represents the probability L^a that, if a lot whose mean 
and standard deviation correspond to the point /a,ct is submitted, the plan 
of action will accept it. If an OC surface is constructed by erecting such 
perpendiculars at all possible combinations of n and <r, it will be found 
that above any straight line in the /x-cr plane passing through U (as for 
example, the sloping line in Fig. 1.2) all the values of L Mltf are equal. 
Since all lots represented by a straight line through U have the same 
value of K, this means that all lots having the same value of K have the 
same probability of acceptance. Hence the OC surface can be reduced 
to an OC curve in two dimensions, the probability of acceptance being 
plotted against the value of K. But, as was pointed out in Sec. 2, all 
lots having the same value of K have the same fraction defective. Thus 
the OC curve can show the probability of acceptance as a function of p> 
as in Fig. 1.3. 


4. PLAN OF ACTION 

4.1. General Procedure 

The first step in selecting the values of k and N that determine a 
particular plan is to select two points through which it is desired that the 
OC curve pass. The first point corresponds to a lot fraction defective pi 
for which it is desired that the probability of rejection be not greater 
than a. The second point corresponds to a lot fraction defective p 2 
(p 2 > Pi) for which it is desired that the probability of acceptance be 
only 0 (a + 0 < 1). Fractions defective of pi or less thus represent 
acceptable quality; a is the maximum producer’s risk, that is, the maxi¬ 
mum risk that an acceptable lot will be rejected by the sampling plan. 
Similarly, fractions defective of p 2 or more represent unacceptable qual¬ 
ity; 0 is the maximum consumer’s risk, that is, the maximum risk that 
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an unacceptable lot will be accepted by the sampling plan. For lots of 
quality better than pi (fractions defective less than p x ) the producer’s 
risk will be less than a, and for lots of quality worse than p 2 (fractions 
defective more than p 2 ) the consumer’s risk will be less than 0. 

The two points on the OC curve thus correspond with 

P = Pi L PI = 1 - a (7) 

P = P 2 L pt = p ( 8 ) 

From these, k and N can be computed. If N proves too large, it is 
necessary to select two points on a less discriminating OC curve. The 
more discriminating, that is, the steeper, tne OC curve, the larger N; 
consequently, N may be reduced by widening the gap between p x and p% 
or by increasing the sum of the risks a + /?, either of which results in 
a flatter OC curve 


4.2. Formulas for k 'and N 

Let K tf where 0 < t < 1, represent a normal deviate exceeded with 
probability c; that is, K t is defined by 

1 f* 

Values of K t corresponding with various common values of € are given in 
Table 1.1. Four values of e, namely, p h p 2 , a, and 0, are of particular 
interest; and the corresponding values of K t may be designated K ly K 2y 
K a} and Kp. 

The value of k may be computed from the formula 

_ KJC, + KfjKx 

k ~ K a + K„ (10) 


The sample size can be determined either from 

_ 2 (K m + K 0 y + (K a K t + KfKiY 
" ~ 2 (Kt - K,Y 


or from 


N = 


W + 2 (K a + K 0 \ 2 
2 - K t ) 


(U) 

( 12 ) 


The value of N given by (11) and (12) ordinarily will not be an integer; 
it should be rounded to the next higher integer, not to the nearest integer. * 
Note that A; is a weighted average of Ki and K 2f the weights being 
Kfi and K a> respectively. Thus, if both a and 0 are less than 0.5, A is 
between K x and K 2 ; and if a and ft are equal, k = ( K\ + K 2 )/2. 

* Two more formulas for N are given by equations (134) and (135), Sec. 11.1.2, 
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TABLE 1.1 

Normal Deviates Exceeded with Given Probability 


Values of K t for various e, where 


1 f * 

y/2 


2 dx 



c 

0.00 

0.0 

0.1 

0.2 

0.3 

0.4 

0 

00 

00 

1.28155 

0.84162 

0.52440 

0.25335 

1 

3.09023 

2.32635 

1.22653 

0.80642 

0.49585 

0.22754 

2 

2.87816 

2.05375 

1.17499 

0.77219 

0.46770 

0.20189 

3 

2.74778 

1.88079 

1.12639 

0.73885 

0.43991 

0.17637 

4 

2.65207 

1.75069 

1.08032 

0.70630 

0.41246 

0.15097 

5 

2.57583 

1.64485 

1.03643 

0.67449 

0.38532 

0.12566 

6 

2.51214 

1.55477 

0.99446 

0.64335 

0.35846 

0.10043 

7 

2.45726 

1.47579 

0.95417 

0.61281 

0 33185 

0.07527 

8 

2.40892 

1.40507 

0.91537 

0.58284 

0.30548 

0.05015 

9 

2.36562 

1.34076 

0.87790 

0.55338 

0.27932 

i 

0 02507 

1 


When € > 0.5, K t — —/Ci_ e . When e = 0.5, K 0 .5 = 0. 

The entries in column (c) are additional digits for the values of e shown in the top 
row. Thus, the entry 1.12630 in column (0.1) and row (3) corresponds with e = 0.13; 
similarly, the entry 2.74778 in column (0.00) and row (3) corresponds with e = 0.003. 

A final 5 in italic indicates that, in rounding, the nearest odd digit should be used 
in the fourth decimal place. When a final 5 is not in italic, follow the usual rule of 
rounding 0.5 to the nearest even integer. Thus a final pair 55 becomes 6, 35 becomes 
3, So becomes 9. 

For a detailed table see The Kelley Statistical Tables; Table I gives values of K e 
(denoted by x) to 8 decimal places for values of e (denoted by q ) from 0 to 0.5 by 
steps of 0.0001. Alternatively, see Fisher and Yates, Statistical Tables; Table I gives 
values of K e (denoted by x) to 6 decimal places for values of e (denoted by P/2) from 
0 to 0.495 by steps of 0.005; also Table IX gives values of K ( 5 to 4 decimal places 
for values of 1 — e from 0 to 0.980 by steps of 0.001 and from 0.9800 to 0.9999 by 
steps of 0.0001. 


4.3. Illustration 

To illustrate the use of equations (10), (11), and (12), suppose that 
lots are considered acceptable if the percent defective is 15 or less and 
unacceptable if the percent defective is 30 or more, and suppose the 
maximum producer’s and consumer’s risks are to be 1 percent and 2 per¬ 


cent, respectively. Then 

pi = 0.15 a = 0.01 (13) 

P 2 = 0.30 p = 0.02 (14) 

From Table 1.1, using only 4 decimal places, we find 

Ki = 1.0364 K a = 2.3263 (15) 

K % = 0.5244 K fl = 2.0537 (16) 
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Substituting from (15) and (16) into (10) gives 


2.3263 X 0.5244 + 2.0537 X 1.0364 
2.3263 + 2.0537 


3.3484 
4.3800 
= 0.7645 


(17) 


Substituting from (15) and (16) into (11) gives 

. r _ 2 X 4.3800 2 + 3.3484 2 
2 X 0.5J20 2 

= 94.6 


(18) 


The same value of N results from substituting from (15), (16), and (17) 
into (12) 


__ 0.7645 2 + 2 /4.3800V 
2 V0.5120/ 

- 1.2922 X 73.1829 
= 94.6 


(19) 


Thus, in this case a sample of 95 items would be drawn at random 
from the lot and a measurement of the characteristic being inspected 
would be made on each item of the sample. The sum of the measure¬ 
ments and the sum of their squares would be computed. These sums 
would be substituted into equations (4) and (6) to find the sample mean 
and standard deviation, x and s. Finally, x + 0.7645$ would be com¬ 
puted; if it exceeded U (the value of the quality characteristic above 
which an item is considered defective) the lot would be rejected, other¬ 
wise the lot would be accepted. (Actually, the abbreviated computa¬ 
tional procedures described in Sec. 7 might be used.) 

4.1. Adjustment of N 

If the sample size given by (11) and (12) is too large, two new points 
may be selected in such a way that the OC curve will be less steep. For 
example, if pi and p 2 are kept at 15 and 30 percent but a and 0 are 
doubled, to 2 and 4 percent, N is reduced to 72; or if a and f$ are kept at 
1 and 2 percent but pi and p 2 are changed to 10 and 35 percent, N becomes 
32. 


4.5. Adjustment of k with N Fixed 

4.6.1. Formula. —If for some reason N is definitely fixed, the OC curve 
can be made to pass through any one point desired, say 

p = Pi L Pl = 1 — a (20) 

by setting 

K t ± y/K\ - ab 


k 


a 


( 21 ) 
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where the plus sign is to be used if a > 0.5 and the minus sign if a < 0.5, 
and where 



„ 1 Kl 
d = 1 -- 

(22) 

and 

2(N - 1) 

K 2 

b = Kl ~-N 

(23) 

4.6.2. Example.- 

—Suppose, for example, that a sample of 95 has been 

decided upon, and 

a sampling plan is desired such that the OC 

curve 

passes through the point defined by (14). Then (20) becomes 



pi = 0.30 L Pi = 0.02 

(24) 

and from Table 1.1 

we find 



Ki = 0.5244 K a = -2.0537 

(25) 

Hence 

, 4.2177 

a 1 188 

(26) 


= 0.9776 

b = 0.2750 - 4 |^ 7 

95 

(27) 


- 0.2306 
ab = 0.2254 

(28) 


K \ - ab = 0.2750 - 0.2254 

(29) 


= 0.0496 

\/K\-~lib = 0.2227 

(30) 


, 0.5244 + 0.2227 

0.9770 

(31) 


= 0.7042 



This differs slightly from the value 0.7645 yielded by (17) for a sample 
of 95 and an OC curve passing through the points defined by (13) and 
(14). The value of k yielded by (31) would be equal (except for differ¬ 
ences due to rounding) to that yielded by (17) if in (22) and (23) both N 
and N — 1 were taken as 94.6. 

4.6.3. A Special Case: Indifference Quality. —If N is fixed and k is to 
be selected so that the OC curve will pass through one specified point, 
this point might reasonably be chosen at the indifference quality p 0 . 
The indifference quality is the quality at which it is a matter of complete 
indifference whether the lot is accepted or rejected.* Lots of better 

* For a discussion of the indifference quality as a key to the selection of sampling- 
inspection plans, see Statistical Research Group, Sampling Inspection , Chap. 15, Sec. 
2.2.1.3, p. 145. 
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quality should be accepted more often than rejected, lots of worse quality 
should be rejected more often than accepted, and lots of the indifference 
quality should be accepted and rejected equally frequently. Thus, a in 
(20) becomes 0.5, K a becomes 0, and (21) reduces to k = Ko y the normal 
deviate exceeded with probability equal to the indifference quality. 

If k is determined from the indifference quality, it does not depend 
upon N, for the term in (22) and the term in (23) that involve N drop 
out when K a = 0. Thus if k is determined from the indifference quality, 
changes in N will not change the value of p for which acceptance and 
rejection are equally likely, nor will they require changes in k . Roughly 
speaking, changes in N affect the steepness of the curve, while changes in 
k affect its location with respect to the quality scale. 

4.6. Approximations in the Plan 

Three kinds of approximation are involved in using formulas (10), 
(M), and (12). First, there is the approximation involved in assuming 
the quality characteristic to be normally distributed. Second, certain 
mathematical approximations are involved in the derivation of the 
formulas (see Sec. 11.2). Third, the value of N given by formulas (11) 
and (12) is not ordinarily an integer, so an approximation is involved in 
substituting an integer. The effect of the first approximation is dis¬ 
cussed later (Sec. 9.4). Even if the quality characteristic were exactly 
normally distributed, the result of the second and third approximations 
would be that the true OC curve would not pass precisely through the 
points intended; but it would come sufficiently close for all practical 
purposes. In the present illustration, for example, taking N = 95 and 
k = 0.7645 results in a producer's risk of 1 percent at a percent defective 
of 15.04 instead of 15. The inaccuracies due to the second and third 
approximations tend to be larger for smaller samples, both because the 
effect of the mathematical approximations is larger and because the 
effect of rounding N is, on the average, larger; this point is discussed 
further in Secs. 4.7 and 11.2. 

4.7. Table of N and k 

Table 1.2 contains values of N and k for 151 combinations of pi and p 2 . 
The values of pi in the table range from 0.001 to 0.05; the values of p 2 
vary from about %pi or up to 10pi or more. The values of a are 
in all cases 0.05 and the values of p are 0.10. The values of N and k have 
been computed by methods explained in Sec. 11.5 that do not involve 
the mathematical approximations used in formulas (10) and (11) or (12), 
so they may be slightly different from, and more accurate than, the 
values given by (10) and (11) or (12). This is especially true when N 
is small, not only because of the mathematical approximations under- 
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TABLE 1.2 

Values op N and k for Various Combinations of pi and p 2 when a = 0.05 and 

fi = 0.10 

N is*the sample size; k is the multiple of the standard deviation in the criterion 
x + ks < U for accepting the lot, where U is the value above which the characteristic 
being inspected is regarded as defective; pi is the acceptable quality level (AQL); 
p 2 is the lot tolerance percent defective (LTPD); a = 0.05 is the producers risk; and 
0 = 0.10 is the consumer's risk. The true pi and true p 2 are the values at which, for 
the corresponding N and k, a = 0.05 and /3 = 0.10; the difference between the true 
and intended values of pi and p 2 arises from the fact that N must be an integer. 

This table has been computed with the aid of Table IV in Johnson and Welch, 
M Applications of the Non-central ^-Distribution.” The entries for N, k, true p u and 
true p 2 are therefore slightly more accurate than those given by formulas (10), (11) 
or (12), and (42), respectively. See Sec. 11.5 on the construction of this table. 


Intended Intended 

Pi P2 . 

N 

k 

True 

Pi 

True 

P2 

0.001 0.0015 

3177 

3.0214 

0.0010 

0.0015 

0.002 

1032 

2.9711 

0.0010 

0.0020 

0.0025 

567 

2.9311 

0.0010 

0 0025 

0.003 

381 

2.8979 

0.0010 

0.0030 

0.004 

226 

2.8441 

0.0010 

0.0040 

0.005 

161 

2.8018 

0.0010 

0 0050 

0.006 

125 

2.7660 

0.0010 

0.0061 

0.007 

103 

2.7359 

0.0010 

0.0071 

0.008 

87 

2.7153 

0.0010 

0.0080 

0.009 

76 

2.6832 

0.0010 

0.0092 

0.01 

68 

2.6725 

0.0010 

0.0099 

0.012 

56 

2.6220 

0.0010 

0.0122 

0.015 

45 

2.5730 

0.0010 

0.0153 

0.02 

34 

2.5208 

0 0010 

0.0201 

0.025 

28 

2.4712 

0.0010 

0.0250 

0.03 

24 

2.4392 

0 0010 

0.0294 

0.035 

21 

2.3930 

0.0010 

0.0351 

0.04 

19 

2.3745 

0.0010 

0.0389 

0.05 

16 

2.3059 

0.0010 

0.0494 

0.06 

14 

2.2570 

0.0010 

0.0590 

0.0025 0.004 

1677 

2.7199 

0.0025 

0.0040 

0.005 

736 

2.6772 

0.0025 

0.0050 

0.006 

443 

2.6413 

0.0025 

0.0060 

0.0075 

268 

2.5967 

0.0025 

0.0075 

0.01 

158 

2.5376 

0.0025 

0.0101 

0.012 

118 

2.4979 

0.0025 

0.0121 

0.015 

86 

2 4576 

0.0025 

0.0149 

0.02 

59 

2.3875 

0.0025 

0.0202 

0.025 

46 

2.3410 

0.0025 

0.0249 

0.03 

37 

2.2981 

0.0025 

0.0301 

0.035 

32 

2.2610 

0.0025 

0.0348 

0.04 

28 

2.2354 

0.0025 

0.0391 

0.05 

22 

2.1724 

0.0025 

0.0501 

0.06 

19 

2.1321 

0.0025 

0.0585 
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TABLE 1.2.—( Continued ) 


Intended 

Pi 

Intended 

P2 

N 

k 

True 

Pi 

True 

Vt 

0.015 

0.02 

\ 2035 

2.1047 

0.0150 

0.0200 


0.025 

603 

2.0520 

0.0150 

0.0250 


0.03 

310 

2.0077 

0.0150 

0.0301 


0.035 

197 

1.9688 

0.0151 

0.0352 


0.04 

141 

1.9348 

0.0151 

0.0402 


0.045 

108 

1.9033 

0.0151 

0.0453 


0.05 

87 

1.8814 

0.0149 

0.0497 


0.06 

61 

1.8271 

0.0151 

0.0604 


0.07 

47 

1.7864 

0.0151 

0.0701 


0.08 

38 

1.7468 

0.0152 

0.0801 


0.09 

32 

1.7223 

0.0148 

0.0884 


0.10 

27 

1.6856 

0.0149 

0.0994 


0.11 . 

24 

1.6548 

0.0152 

0.1088 


0.12 

21 

1.6330 

0.0149 

0.1183 


0.13 

19 

1.6002 

0.0153 

0.1288 


0.14 

17 

1.5767 

0.0152 

0.1392 


0.15 

15 

1.5528 

0.0148 

0.1514 

0.02 

0.03 

835 

1.9565 

0.0200 

0.0300 


0.035 

417 

1.9179 

0.0200 

0.0351 


0.04 

260 

1.8836 

0.0201 

0.0401 


0.045 

182 

1.8523 

0.0201 

0.0452 


0.05 

138 

1.8247 

0.0201 

0.0501 


0.06 

90 

1.7680 

0.0205 

0.0609 


0.07 

65 

1.7364 

0.0199 

0.0697 


0.08 

50 

1.6971 

0.0199 

0.0800 


0.09 

41 

1.6617 

0.0200 

0.0898 


0.10 

34 

1.6290 

0.0200 

0.1001 


0.11 

29 

1.5968 

0.0201 

0.1107 


0.12 

26 

1.5722 

0.0203 

0.1191 


0.13 

23 

1.5473 

0 0203 

0.1287 


0.15 

18 

1.5037 

0.0196 

0.1495 


0.17 

15 

1.4623 

0.0195 

0.1692 


0.20 

12 

1.4041 

0.0196 

0.1986 

0.03 

0.04 

1334 

1.8077 

0.0300 

0.0400 


0.045 

643 

1.7766 

0.0300 

0.0451 


0.05 

390 

1.7483 

0.0300 

0.0501 


0.06 

198 

1.6979 

0.0301 

0.0601 


0.07 

125 

1.6539 

0.0302 

0.0702 


0.08 

88 

1.6083 

0.0306 

0.0812 


0.09 

67 

1.5844 

0.0298 

0.0896 


0.10 

53 

1.5517 

0.0298 

0.0998 


0.11 

44 

1.5236 

0.0298 

0.1092 

i 

0.12 

37 

1.4955 

0.0298 

0.1194 


0.13 

32 

1.4704 

0.0298 

0.1290 


0.15 

25 

1.4224 

0.0299 

0.1485 


0.20 

15 

1.3149 

0.0297 

0.2013 


0.25 

11 

1.2281 

0.0305 

0.2477 


0.30 

8 

1.1553 

0.0293 

0.3007 
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TABLE 1.2.—( Continued) 


Intended i 

Pi 

Intended 

Vi 

N 

k 

True 

Pi 

True 

P* 

0.04 

0.06 

524 

1.6406 

0.0400 

0.0601 


0.07 

259 

1.5964 

0.0401 

0.0701 


0.08 

160 

1.5569 

0.0402 

0.0802 


0.09 

111 

1.5209 

0.0402 

0.0903 


0.10 

83 

1.4920 

0.0399 

0.0997 


0.11 

65 

1.4621 

0.0399 

0.1098 


0.12 

53 

1.4348 

0.0398 

0.1196 


0.13 

45 

i 4112 

0.0398 

0.1286 


0.14 

38 

1.3793 

0.0403 

0.1402 


0.15 

33 

l.3565 

0.0402 

0.1500 


0.17 

26 

1.3195 

0.0397 

0.1681 


0.20 

19 

1.2585 

0.0396 

0.1988 


U.25 

13 

1.1647 

0.0405 

0.2491 


0.30 

9 

1.0967 

0.0378 

0.3027 

i 

0.35 

7 

1.0274 

0.0376 

0.3524 


0 40 

6 

0.9480 

0.0412 

0.3963 

0.05 

0 07 

661 

1.5499 

0.0500 

0.0701 


0 08 

320 

1.5102 

0.0501 

0.0801 


0.09 

194 

1.4740 

0.0501 

0.0902 


0.10 

133 

1.4408 

0.0502 

0.1004 


0.11 

99 

1.4096 

0.0504 

0.1104 


0.12 

77 

1.3826 

0.0503 

0.1202 


0.13 

62 

1.3567 

0.0502 

0.1302 


0.14 

52 

1.3365 

0.0498 

0.1389 


0.15 

44 

1.3129 

0.0498 

0.1488 


0.16 

38 

1.2857 

0.0503 

0.1597 


0.17 

33 

1.2636 

0.0503 

0.1700 


0.20 

24 

1.2047 

0.0506 

0.1985 


0.25 

15 

1.1189 

0.0494 

0.2499 


0.30 

11 

1.0386 

0.0503 

0.2981 


0.35 

8 

0.9633 

0.0493 

0.3537 


0.40 

6 

0 9162 

0.0448 

0.4052 


lying (10) and (11) or (12), but also because of the rounding of AT. For 
example, if p\ = 0.04, p 2 = 0.40, a = 0.05, and 0 = 0.10, Table 1.2 
shows N = 6 and k = 0.9480. The table also shows that for these values 
of N and k the true values of pi and p 2 , which, because N must be an 
integer, cannot be made precisely equal to the intended values, are 
0.0412 and 0.3963. Formulas (10) and (11) yield for this example 
k = 0.9091 and N = 5.4, which is rounded upward to N = 6; for N = 6 
and k = 0.9091 the true pi and p 2 are 0.0457 and 0.4073. If k is adjusted 
by formula (21), taking N = 6, p = 0.04, and a = 0.05, it becomes 
0.9247. For this adjusted value of k the true pi and p 2 are 0.0439 and 
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0.4029. Thus, even for this case where N is only 6, it is questionable 
whether the greater accuracy of Table 1.2 or of the adjusted k , as com¬ 
pared with formulas (10) and (11) or (12), is of substantial value for most 
practical purposes; and for larger values of N the differences are even less 
important. 


5. AMOUNT OF INSPECTION 

6.1. Comparison with Attribute Inspection 

If each item were classified simply defective or nondefective and the 
acceptability of the lot judged by the number of defectives in a single 
sample, a sample of 145 would be required for an OC curve passing 
through the points defined by (13) and (14).* If the sampling were done 
sequentially by attributes, the average number of items inspected would 
be 20 for lots with no defectives, 63 for lots 15 percent defective, 133 for 
lots 22 percent defective, 61 for lots 30 percent defective, and 7 for lots 
100 percent defective, f 


TABLE 1.3 

Sample Sizes for Eight Pairs of Single-sampling Plans Based on Attributes 

and Variables 

a = 0.05, 0 =0.10 for all plans 

N a = sample size for attributes 
N t = sample size for variables 


Plan 

number 

Pi 

P2 

Nt 

N a 

Na/Nt 

Saving 

(N a - Nt)/N a 
percent 

1 



1,032 

12,476 

12.09 

91.7 

2 



14 

46 

3 29 

69.6 

3 

0.01 

0.015 

1,231 

4,185 

3.40 

70.6 

4 


0.10 

21 

44 

2.10 

52.3 

5 


0.07 

661 

1,199 

1.81 

44.9 

6 

0.05 

0.40 

6 

10 

1.67 

40.0 

7 

0.15 

0.20 

323 

493 

1.53 

34.5 

8 

0.15 

0.40 

17 

26 

1.53 

34.6 


For attributes the sample sizes have been computed as explained in Chap. 7. For variables they 
have been computed by the method described in Sec. 11.5 of this chapter, which sometimes gives values 
slightly different from, and slightly more accurate than, those given by formulas (11) and (12). 


* See Chap. 7, Sec. 3, equation (21) and first footnote on p. 260. 
t See Statistical Research Group, Sequential Analysis , Sec. 2 (second revision), 
where the example defined by equations (13) and (14) above is treated in some detail. 
Sequential analysis also can be used with variables (see Sec. 11.6 below) and will 
result in a reduction in the average amount of inspection similar to that with attributes 
—roughly 40 to 60 percent (see Sequential Analysis , Appendix A, p. A.03). 
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The savings in the number of items inspected by using variables 
instead of attributes depends partly upon the steepness of the OC curve 
and partly upon its location. The steeper the OC curve, the greater the 
saving fr »m using variables. The better the lot qualities involved (that 
is, the smaller the percentages defective), the greater the saving. Eight 
comparisons of attribute and variables plans are shown in Table 1.3. In 
each case, a = 0.05 and = 0.10. N a and N t represent the sample sizes 
for single-sampling plans based on attributes and variables, respectively. 

6.2. Comparison with Variables Inspection When the Standard Devia¬ 
tion Is Known 

If the true standard deviation of the characteristic being inspected 
is gren, the proportion defective depends only upon the mean, being p x 
or i *ss if the mean is V — Ki<r or less, and p 2 or more if the mean is 
U - K& or more.* If the standard deviation were known, a sample of 
73 would be requi for at. OC curve passing through the points defined 
by (13) and (14). The reduction of 22 observations (23 percent) fro i t 
the 95 needed when the standard deviation is not known thus represents 

TABLE 1.4 

Sample Sizes for Eight Pairs of Single-sampling Variables Plans Based on 
Known and Unknown Standard Deviations 
a = 0.05, ft =0.10 for all plans 
Nt = sample size with standard deviation unknown 
N m = sample size with standard deviation known 


Plan 

number 

Pi 

P2 

Nt 

N m 

N,/N m 

Saving 

(Nt - Nm)/N t 
percent 

M 

1 



1,032 

190 

5.43 

81.6 

2 



14 

4 

3.50 

71.4 

3 

0.01 


1,231 

351 

3.51 

71.5 

4 

0.01 


21 

8 

2.62 

61.9 

5 

0.05 

I 

661 


mmm 

54.6 

6 

0.05 

MBm 

6 

4 

wMM 

33.3 

7 

0.15 

BH 

323 

226 

1.43 

30.0 

8 

0.15 

0.40 

17 

14 

1.21 

17.6 


For the known standard deviation, the sample sizes are given by l/o, where a is defined by equation 
(137), Sec. 11.1 of this chapter. For the unknown standard deviation they have been computed by the 
method described in Sec. 11.5 of this chapter, which sometimes gives values slightly different from, and 
slightly more accurate than, those given by formulas (11) and (12). 

the savings in inspection yielded by knowledge of the standard deviation, t 
This saving, like that yielded by sufficient knowledge of the form of 

* The use of variables in inspection for proportion defective when the standard 
deviation is known is discussed by Romig, Allowable Average in Sampling Inspection . 

t Incidentally, 22 is the number of observations required for an OC curve satisfy¬ 
ing (13) and (14) if the true mean is known and only the standard deviation need be 
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distribution to permit the use of variables instead of attributes, varies 
with the OC curve. Comparisons between N t , the sample size for the 
variables plans with unknown mean and standard deviation, and iY m , the 
sample size with known standard deviation, are shown in Table 1.4 for 
eight plans defined by the same pairs of OC points used in Table 1.3. In 
general, the percentage saving, computed before N t and N m are rounded 
to integers, is 100k 2 /(k 2 + 2), but for small samples it may differ from 
this on account of rounding. 

6. OPERATING-CHARACTERISTIC CURVE 

6.1. Introduction 

The values of k and N may be obtained from (10) and (11) or (12) 
so that the OC curve will pass through two specified points. They may 
be obtained by selecting N arbitrarily and computing k from (21) so 
that the OC curve will pass through one specified point. They may both 
be determined arbitrarily. Regardless of how they are chosen, their 
effects must be evaluated in terms of the OC curve to which they give 
rise. How will they operate with lots of every quality that might con¬ 
ceivably be offered? If a lot is good, what is the risk that the sample 
from it will fail to meet the acceptance requirement? If a lot is bad, 
what is the risk that the sample will satisfy the acceptance criterion? 
The OC curve shows, for every possible lot quality p, the probability L p 
that a lot of that quality will, if submitted, be accepted on the basis of the 
sampling plan. 

6.2. Five-point OC Curve 

Three points on the OC curve are easily obtained: (1) When there 
are no defectives in the lot, acceptance is certain; (2) when the lot con¬ 
tains only defectives, rejection is certain; (3) when the proportion of 
defectives in the lot is po , such that K Po = k, acceptance and rejection are 
equally likely. Thus three points on the OC curve are given by 


p = 0 

I/O = 1 

(32) 

P = Po 

L P0 = 0.5 

(33) 

p = 1 

L x = 0 

(34) 


used in testing percent defective. In general, if Nt is the number of observations 
required when neither the mean nor the standard deviation is known, N m the number 
required when the standard deviation is known, and N, the number required when 
the mean is known, the relation Nt *= N m + N, will hold; in fact, N m = 2N t /{k 2 + 2) 
and N, =» k*Nt/(k 2 *f 2). See Chap. 8 for methods of determining the sample size 
Nm when inspection is based on the standard deviation; the sample size AT m , when 
inspection is based on the mean, can be determined from equation (148) in Sec. 11.1 
of this chapter: it is l/o, where a is defined by equation (137). 
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where p 0 is the probability that a normal deviate will exceed k. If k and 
N have been determined by (10) and (11) or (12), the OC curve will also 
pass through the two points 

V = Pi L Pl = 1 — a (35) 

V - P2 L px = p (36) 

A sketch of the OC curve made from these five points is sufficient for 
many purposes. For the illustrative example of Sec. 4.3, for example, 
we have 


p =•■= 0 

Tjq — 1 

(37) 

Pi ~ 0.15 

A*o.i& == 0.99 

(38) 

Po 0.2223 

/- 0.2223 = 0.5 

(39) 

7>2 0.30 

A o so — 0.02 

(40) 

p - 1 

Lx = 0 

(41) 





Fio. 1.4.—Operating-characteristic ( OC) curve for a variables inspection plan for percent 
defective; N - 95, k - 0.7645. [Data from Table 1.6, columns {6), (7), (0).l 
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The central three of these five points are represented by dots in Fig. 1.4; 
the two extreme points are not shown because of the incompleteness of the 
p scale. 

6.3. Detailed OC Curve 

As many points on the OC curve as desired may be found from the 
following formula, which expresses the normal deviate corresponding with 
the lot quality in terms of the normal deviate corresponding with any 
given probability o? acceptance: 

K p = k — Kl t yjjj + ( 42 ) 

Formula (42) yields the normal deviate corresponding with p. The value 
of p can be found from Table 1.5, which shows the probabilities of 
exceeding various normal deviates. 

If p is to be computed for a number of values of L p in order to plot 
the entire OC curve, it is convenient while computing (42) to compute 

K p ’ = k + Kl p + 2(N~=T) (^3) 

If it is desired to find the probability of acceptance for a lot of given 
quality, (42) and (43) can be rewritten as 


K Lp = 


k - K v 


$ 


+ 




2(N - 1) 


K Lf . = -K,., 


(44) 

(45) 


6.3.1. Illustrative Computation for One Point. —To illustrate the use 
of formulas (42) and (43), consider an acceptance-inspection plan defined 
by N — 95 and A;= 0.7645. For what lot quality is the probability of 
rejection 1 percent? 

We have 


Then 


N = 95 k = 0.7645 K 0 ., t = -2.32635 

>/v + = °- 11677 

Kl ’ 2 (N — 1) = — ^0.27165 


(46) 

(47) 

(48) 


K p - 1.03615 


(49) 


Substituting into (42), 
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TABLE 1.6 

Probabilities Corresponding with Given Normal Deviates 

_£? 

Value* of « for variousK,, where —= / e 2 dx ** c 

V2tJK. 

K, 0123456780 

0.0 0.50000.49600.4020 0.4880 0.48400.48010.47610.47210.46810.4641 

0.1 0.4602 0.4562 0.4522 0.4483 0.44430.4404 0.4364 0.4325 0.4286 0.4247 

0.2 0.4207 0.4168 0.4120 0.4000 0.4052 0.4013 0.3974 0.3036 0.3897 0.3859 

0.3 0.38210.3783 0.3745 0.3707 0.3600,0.3632 0.3594 0.3557 0.3520 0.3483 

0.4 0.34460.3409 0.33720.3336 0.3300 0.3264 0.32280.3192 0.31660.3121 

0.5 0.3085 0.30/>0 0.3016 0.2981 0.2946 0.29120.28770.28430.28100.2776 

I .6 0.27430.27**90.26760.26430.2611 0.25780.25460.2514 0.24830.2451 

C.7 0.242O0.23890.23580.2327 0.22960.2266 0.2236 0.2206 0.21770.2148 

0.8 0.21i9Q 20900.2061 0.20330.2005 0.1977 0.1949 0.1922 0.1894 0.1867 

©9 0 1841 6.18140.1788 0.1762 0.17360.1711 0.16850.16600.16350.1611 

1.0 0.1587 0.1562 0.1539 0.1516 0.1492 0.1469 0.1446 0.1423 0.1401 0.1379 

1.1 0.13570.13350 1314 0.1292 0.12710.12510.1230 0.1210 0.11900.1170 

1.2 0.11510.11310.1112 0.1093 0.1075 0.1056 0.10380.10200.10030.0985 

1.3 0.09680.09510.0934 0.09180.09010.0885 0.0869 0.08530.08380.0823 

1.4 0.08080.07930.07780.0764 0.0749 0.0735 0.07210.07080.0694 0.0681 

1.5 0.06680.0655 0.0643 0.0630 0.06180.06060.0594 0.0582 0.05710.0559 

1.6 0.05480.0537 0.05260.0516 0.0505 0.0495 0.04850.0475 0.04650.0465 

1.7 0.0446 0.04360.0427 0.04180.0409 0.04010.03920.0384 0.03750.0367 

1.8 0.03590.03510.0344 0.0336 0.0329 0.0322 0.0314 0.0307 0.03010.0294 

1.9 0.0287 0.0281 0.0274 0.0268 0.0262 0.0256 0.0250 0.0244 0.0239 0.0233 

2.0 0.02280.0222 0.02170.0212 0.0207 0.02020.01970.01920.01880.0183 

2.1 0.0179 0.01740.01700.0166 0.0162 0.0158 0.0154 0.0150 0.0146 0.0143 

2.2 0.01390.01360.0132 0.0129 0.0125 0.01220.01190.0116 0.01130.0110 

2.3 0.01070.01040.01020.00990.00960.00940.00910.00890.00870.0084 

2.4 0.00820.00800.00780.00750.00730.00710.00690.00680.00660.0064 

2.5 0.00620.00600.00590.00570.00550.00540.00520.0051 0.00490.0048 

2.6 0.00470.0045 0.0044 0.00430.00410.00400.00390.00380.00370.0036 

2.7 0.00350.0034 0.00330.0032 0.00310.00300.0029 0.0028 0.00270.0026 

2.8 0.00260.0025 0.0024 0.00230.00230.00220.00210.00210.00200.0019 

2.9 0.00190.00180.00180.00170.00160.00160.00150.00150.0014 0.0014 

3.0 0.00130.00130.00130.0012 0.00120.00110.00110.00110.00100.0010 

The entries in row ( K ,) are additional digits for the values of K, shown in the first 
column. Thus, the entry 0.0934 in column (2) and row (1.3) corresponds with 
K t — 1.32; similarly, the entry 0.4840 in column (4) and row (0.0) corresponds with 
K. - 0.04. 

A 5 in italic indicates that, in rounding, the nearest odd digit should be used in 
the preceding decimal place. When a 5 is not in italic, follow the usual rule of 
rounding to the nearest even digit. Thus a final pair 55 becomes 6, 35 becomes 3, 
85 becomes 9. 

For a detailed table see New York Mathematical Tables Project Tables of Prob¬ 
ability Functions. Table I gives values of 1 — 2« to 15 decimal places for values of 
K t (denoted by x) from 0 to 1 by stepB of 0.0001, and for all values of Kt above 1 by 
steps of 0.001. 
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Interpolating linearly in Table 1.5, we find p = 15.01 percent.* Simi¬ 
larly, on substituting into (43) we find K p » = 0.49285;sop = 31.11 per¬ 
cent for L p = 0.01. 

6.3.2. Work Sheet for Detailed OC Curve. —When a number of points 
on the OC curve are needed, it is convenient to arrange the computations 
in a systematic way, as for example that shown in Table 1.6, which 
contains the computations for the OC curve of the example of Sec. 4.3. 

TABLE 1.6 

Work Sheet for Operating-characteristic Curve 


Basic quantities: N — 95; k — 0.7645; S — rj) = 6.11677 


(1) 

Lv 

(2) 

K Lp 

(3) 

S X (2) 

(4) 

k - (3) 

(5) 

k + (3) 

(6) 

V 

(7) 

V’ 

(8) 

!-(!)= iy 

0.005 

2.57583 

0.30078 

0.46372 

1.06528 

0.321 

0 143 

0.995 

0.01 

2.32635 

0.27165 

0.49285 

1.03615 

0.311 

0.150 

0.99 

0.02 

2.05375 

0.23982 

0.52468 

1.00432 

0.300 

0.158 

0 98 

0.03 

1.88079 

0.21962 

0.54488 

0.98412 

0.293 

0.163 

0.97 

0.04 

1.75069 

0.20443 

0.56007 

0.96893 

0.288 

0.166 

0.96 

0.05 

1.64485 

0.19207 

0.57243 

0.95657 

0.284 

0.169 

0.95 

0.15 

1.03643 

0.12102 

0.64348 

0.88552 

0.260 

0.188 

0.85 

0.25 

0.67449 

0.07876 

0.68574 

0.84326 

0.246 

0.200 

0.75 

0.50 

0 

0 

0.76450 

0.76450 

0.222 

0.222 

0.50 


First , enter at the top of the sheet the three basic quantities N, k , and 
S, S being defined by the left-hand side of equation (47). 

Second , enter in column (1) various ordinates L p less than or equal to 
0.5 for which the corresponding abscissas p are to be computed. 

Third , enter in column (2) values Ki v of the normal deviates corre¬ 
sponding with the values of L p ; these may be obtained from Table 1.1. 

Fourth , enter in column (3) the products of S by the normal deviates 
in column (2). 

Fifth , subtract from k the entries in column (3) and enter the differ¬ 
ences in column (4). 

Sixth , add to k the entries in column (3) and enter the sums in column 
(5). 

Seventh , enter in column (6) the probabilities corresponding with the 
normal deviates in column (4); these may be obtained from Table 1.5. 

Eighth , enter in column (7) the probabilities corresponding with the 
normal deviates in column (5); these may be obtained from Table 1.5. 

* The value 15.04 percent stated in Sec. 4.6 was obtained by an exact method, 
described in Sec. 11.5, which does not involve the mathematical approximations 
involved in formulas (42) and (43). 
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Ninth , subtract from 1 the values of L p shown in the first column and 
enter the differences in column (8). 

Tenth , plot one-half the OC curve, using as abscissas the values of p 
shown in column (6) and as ordinates the corresponding values of L p 
shown in column (1). 

Eleventh , plot the other half of the OC curve, using as abscissas the 
values of p' shown in column (7) and as ordinates the corresponding 
values of L v > shown in column (8). 

The OC curve corresponding with Table 1.6 is shown in Fig. 1.4. 
The three dots represent the central three points of Sec. 6.2, formulas (38) 
to (40). 

7. TECHNIQUES OF APPLYING THE PLAN 
7.1 Introduction 

Without calculating either the mean x or the standard deviation s, 
the computing can I>e arranged to show whether the acceptance criterion 

x + ks < U (50) 

has been met. This is done partly by rearranging formula (50), and partly 
by carrying out in advance certain steps in the computations that do not 
depend upon the data. 

There is some question as to how much the computational simplifica¬ 
tions add to the procedure. They are likely to be welcomed by those 
who consider the standard deviation “too complicated for practical pur¬ 
poses,” and who would therefore reject (50) as an acceptance criterion if 
it were not emphasized that the standard deviation need not actually be 
computed. Those holding this view, however, often have had little or no 
practical experience in computing or using standard deviations, and what 
experience they have had may have been with inefficient computing tech¬ 
niques. In fact, statisticians do not generally realize how often standard 
deviations are computed by the naive use of formula (5). The mean is 
calculated, then the N differences between it and the observations are 
found and recorded; these are then squared and added, the sum divided 
by N — 1, and the square root taken. Not only is this process tedious 
and liable to error, but there is an inherent loss of accuracy from round¬ 
ing the differences. The standard deviation should, of course, be com¬ 
puted from formula (6). With modern computing machines it can be 
calculated simultaneously with the calculation of the mean and with very 
little extra trouble. With the Monroe Model AA-1, for example, the first 
observation is set in the keyboard and the X and = keys pressed in rapid 
succession; then the next observation is set on the keyboard and the same 
two keys pressed again; and so on. When this has been completed for all 
N observations, which takes little more time and trouble than adding the 
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data, the quantities Lx and Lx 2 needed for formula (6) are found in the 
upper and lower dials, respectively. * 

Another reason computational simplifications are perhaps less impor¬ 
tant than they may at first appear is that the actual values of the mean 
and standard deviation, although not needed for acceptance purposes, are 
valuable for other purposes, as, for example, process control. In fact, 
the conditions under which inspection is based on variables instead of 
attributes are likely to be those where the cost of obtaining each obser¬ 
vation is so high, because of destruction of test items, use of costly test 
equipment, and so forth, that the cost of analyzing the data is almost 
negligible in comparison. 


7.2. Computational Procedure f 

7.2.1. Formulas.-^-For any given plan of action, determined by a fixed 
N and k , the left-hand side of formula (50) depends only upon the sum 
of the observations Lx and the sum of the squares of the observations 
Lx\ Let 

X = Sj (51) 

Y = 2z 2 (52) 


These two sums may be computed easily and simultaneously on any 
automatic computing machine. If only an adding machine is available, 
Y can be obtained with the aid of a table of squares, for example, Barlow's. 
If k is positive, it may be noted first that if 


X > NU (53) 

the lot may be rejected without regard to Y. In that case x exceeds U; 
since s must be positive, x + ks will still further exceed U. 

Second, it may be noted that if 


Y < 


U 2 (N - 1) 
k 2 + 1 


(54) 


the lot may be accepted without regard to X. Equation (72) below is, 
however, more efficient than (54) in permitting immediate acceptance. 


* This paragraph indicates only the general nature of the procedures for computing 
the standard deviation. Further simplifications are usually possible. Also, the use 
of the Monroe Model AA-1 as an illustration does not imply that this make or model 
is superior to other automatic calculating machines. Both the Frid£n and Marchant 
companies make excellent machines. The Statistical Research Group attempted to 
evaluate various makes and models on the basis of about 660 machine-months of 
experience in its offices from 1942 to 1945. The evidence failed to show any one 
make or model to be best, although there were interesting differences. The terms 
“upper dials" and “lower dials" used in this chapter apply to Marchant and Monroe 
calculators but should be interchanged for Friddn calculators. 

t For a discussion of the accuracy with which the measurements should be made, 
see Chap. 4. 
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Third, if neither inequality (53) nor (54) is satisfied, the lot is accepted 


if 

where 


Y <aX*-bX + c 

(55) 

NW + N - 1 

0 NW 

(50) 

. 2 U(N - 1) 

Nk* 

(57) 

„ _ U'(N - 1) 
c k 2 

(58) 


If inequality (55) is not fulfilled, the lot is rejected. The values of 
a, 5, and c given by (56), (57), and (58) c .o not depend upon the outcome 
of the inspection but only upon the sampling plan defined by N and k 
and upon the definition of a defective item as given by U . When a 
sample has been inspected, the values of X and Y are obtained from the 
observations and substituted into (55). Formula (78) below provides an 
additional condition, linear in X , that may be inserted between (54) and 
(55), which are, respectively, independent of and quadratic in X ; formula 
(78), however, requires a preliminary estimate of the region in which the 
sample mean is likely to lie. 

7.2.2. Example.—Consider a sampling plan in which N = 95 and 
k = 0.7645. Suppose that for the characteristic being inspected values 
in excess of U = 35 are regarded as defective. 

From these values of N } k } and U we have, from (56), (57), and (58), 

a = 0.0283471 (59) 

6 = 118.508 (60) 

c = 197,019 (61) 

Now suppose that a sample shows 

X = 2r = 2,872 (62) 

Y = Sx 2 = 89,175 (63) 

First, it is noted that, since 

X < 3,325 (64) 

inequality (53) is not fulfilled, so immediate rejection is not called for. 
Second, it is noted that, since 

Y > 72,675 (65) 

immediate acceptance is not possible on the basis of (54). Applying (55), 
it is found that 

Y < 90,482 (66) 

Therefore, the lot is acceptable. 
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In this example 

x = 30.232 
8 = 5.000 
x + ks = 34.05 

and, since this is less than 35, acceptance is indicated. 

7.3. Graphical Procedure 

7.3.1. Original Data. —The computing involved in substituting X and 
X 2 into formula (55) may be avoided by the use of a chart. Values of X 
may be plotted horizontally and values of Y vertically, and a curve 
(parabola) drawn showing for each value of X the minimum value of 7, 
Y Ri requiring rejection as determined from formula (55) using the equality 
sign. Then as soon as X and Y are determined from a sample, a point 
may be plotted on the chart. If the point falls on or below the parabola, 
the lot is accepted; if the point falls above the parabola, the lot is rejected. 

Figure 1.5 shows such a curve for the example of Sec. 7.2.2, and the 
point corresponding with the sample of that example is shown. * Also 
shown in the figure is a lower boundary for the acceptance region. This 
lower boundary arises from the fact that, for any given sum of the obser¬ 
vations X, it is algebraically impossible for the sum of the .squares of the 
observations Y to be less than a certain minimum; this minimum is the 
value Y takes when all the observations are equal, namely, 

Y 2 

Y« - - r (70) 

If a sample point should fall on the lower line, it would indicate that the 
standard deviation was zero. A sample point cannot fall below the 
lower line unless there is an error in the calculations. 

For practical purposes only a part of the range of X shown in Fig. 1.5 
would be needed, for example, that enclosed in the rectangle. Confining 
.the chart to this region, as in Fig. 1.6, permits the use of a larger scale 
and thus improves the accuracy of plotting. 

7.3.2. Adjusted Data. —The graphical procedure of Sec. 7.3.1 is 
impractical when the observations are large. In such cases the range of 
variation in Y as X varies is large relative to the width of the acceptance 
band for a fixed X ; when the Y scale is contracted sufficiently to go on a 
sheet of reasonable size, the upper and lower boundaries of the acceptance 
region are pulled so close together that the chart becomes useless. A 
remedy for this is to replace the sum of the squares of the observations Y 

* Formula (54) gives the minimum value of formula (55). Actually, it gives a 
value slightly below the minimum, because the minimum, which occurs when 
X - UN(N - 1 )/(Nh* + N - 1), is NU*(N - 1 )/(Nk* + N - 1). If Fisas small 
as (54), it is impossible for X to be too large; this is illustrated in Fig. 1.5. 


(67) 

( 68 ) 
(69) 
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by the sum of the squares of adjusted observations Y f where 

Y' = S(x - A) 2 (71) 

= Y - 2 AX + NA 2 

and A is an adjustment constant. The adjustment constant is, in effect, 
subtracted from each observation before squaring; actually the sum of 
squares of the adjusted observations, Y f } is obtained directly from X and 
Y by formula (71) without adjusting each observation. 

In order to make the chart most useful, the adjustment constant A 
should, if possible, be chosen near the mean of the characteristic being 
inspected. Usually there will be some e udenee as to the general neigh¬ 
borhood in which the mean lies. If there is no evidence at all, it will often 
be suitable to choose A somewhere in the interval between U/(k 2 4* 1) 
and 17. 

To compute F', Y and Y arj obtained simultaneously as described 
in dec. 7.1, and A' is recorded. Then X is copied to the keyboard and 
multiplied negatively by 2 A ) which may be computed in advance. Next 
A 7 A 2 , which also should be computed in advance, is added. These two 
operations adjust the sum of squares, and Y' may be copied from the 
lower dials. An alternative computing procedure, which may be useful 
if the same inspection plan, or at least the same adjustment constant, is 
to be used repeatedly, is to prepare a table showing (x — A) 2 for the 
various values of x that plight be observed. From this table Y' can be 
found ,by simple addition. 

The values of X and Y' determine whether the acceptance criterion 
has been met. If X > NU, immediate rejection may be made, accord¬ 
ing to formula (53). Instead of (54) for immediate acceptance, we have 

V < W - IBK - 12 an 

IS k2+l (72) 

If this is fulfilled the lot may be accepted immediately. Notice that Y' 
may satisfy (72) even when Y fails to satisfy (54). Otherwise, a chart 
similar to Fig. 1.6 may be drawn in which the upper boundary of the 
acceptance region is given by 

Y' = aX 2 - (b + 2 A)X + (c + NA 2 ) (73) 

where o, 6, and c are defined by (56), (57), and (58), and the lower bound¬ 
ary is given by 

= 2AX + NA 2 (74) 

Figure 1.7 shows such a chart for the example of Sec. 7.2.2, the 
adjustment constant being taken as A = 30. Thus, the equations of the 
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X-Ex 

Noth: The Ye curve should extend indefinitely, separating the error and rejection regions, but 
in a small figure this would make it difficult to identify the termination point of the acceptance 
region. 

Fio. 1.5.—Graphical procedure for determining from Xx and Xx* whether x + ka < 17; 
N - 95, k m 0.7645, U - 35. 
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Noti: See note to Fig. 1.5. 

Fig. 1.6. —Enlarged segment of computational chart in terms of original data (enlargement 
of area indicated in Fig 1.5). 
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upper and lower boundaries of the acceptance region are 

Y£ = 0.028347IX 2 - 178.508X + 282.519 (75) 

Yi = 0.0105263X 2 - 60X + 85.500 (76) 

Since 2 A — 60 and NA 2 = 85,500, the value of Y* in the sample is, by 
formula (71), 2,355. This does not satisfy inequality (72), which would 
require that Y f be 1,483 or less; but when the point X = 2,872, Y' = 2,355 
is plotted in Fig. 1.7, it is seen to be within the acceptance region. 

It is stated in Sec. 7.2.1 that (72) is more effective than (54) in 
permitting immediate acceptance. In die present example, (72) requires 
that Y' < 1,483, which (since X •-= 2,87 2, is equivalent to requiring that 
Y < 88,303, whereas (54) requires that Y < 72,675. Thus values of Y 
between 72,675 and 88,303 will lead to immediate acceptance by (72) 
but aot by (54). F om the point of view of permitting acceptance on 
the basis of Y' t iathe than the point of view of making the chart usable, 
vny nearly the moe»« efficient form of (72) is obtained by setting 

A=!L±Jq£-zJL m 

where p! is the best estimate of x available before the sample is inspected. 
In the present example, if m' is taken as 30, formula (77) shows A about 
21.5. Taking A = 21, formula (72) becomes Y f < 11,628, and since 
the sample value of Y' is 10,446, immediate acceptance is possible. The 
value of A given by (77) may be made the basis of a condition for accept¬ 
ance in terms of Y; intermediate between (54) and (55), which are, 
respectively, constant and quadratic in X, the linear condition for accept¬ 
ance may be inserted 

Y <c' - b'X (78) 

where 

V = b - 2 aNn' (79) 

c' = c - aNV 2 (80) 

and a, 6, and c are defined by (56), (57), and (58). Taking g' = 30 in 
the illustrative example, (78) becomes Y < 90,468, which leads to accept¬ 
ance. If the value of g' happens to coincide with x, (78) becomes 
equivalent to (55). 

8. REDUCED AND TIGHTENED INSPECTION 
8.1. Purpose of Reducing or Tightening Inspection 

It is emphasized in Secs. 3.4 and 6.1 that the OC curve shows the 
conditional probability that if a submitted lot has a given quality p 
it will be accepted. It does not show the probability that a lot of 
quality p will be accepted. This latter probability cannot be controlled 
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by the sampling plan alone, for it depends upon the probability that a lot 
of quality p will be submitted. 

In judging the value of the protection afforded by a sampling plan, 
information about the quality of submitted material is important. The 
fact that a plan rejects 90 percent of submitted lots having a certain 
unacceptable quality may be of little importance if a great many of the 
submitted lots are of that quality; in that case many of the lots passed 
by the plan will be unacceptable, even though most of the submitted lots 
are rejected. The fact that the plan rejects 90 percent of submitted 
lots of an unacceptable quality p is also unimportant if submitted lots 
rarely have that quality, for then very little such material will be accepted, 
even though most submitted lots are passed. In the first case, when bad 
material is submitted very often, the protection given by the sampling 
plan may be inadequate and require tightening; but in the second case, 
when bad materia] is rarely submitted, the protection may be superfluous 
and reduction may be permissible. * 

8.2. Criteria for Reducing or Tightening Inspection 

8.2.1. Process Average Percent Defective.—The principal criterion 
for determining whether reduced inspection is permissible or tightened 
inspection necessary is the process average percent defective, usually 
called simply the process average or PA, which is an estimate of the 
percent defective computed from a number of lots. 

This estimate is best stated as a confidence interval, that is, an interval 
within which it may be asserted with, say, 95 percent confidence that the 
true process average lies. If the estimate were stated as a single figure, it 
would invariably differ from the true process average because of sampling 
fluctuations; and the question would therefore arise as to whether the 
difference between the estimated process average and the acceptable 
quality level could reasonably be ascribed to sampling fluctuations, or 
whether on the other hand it indicated a real discrepancy between the true 
process average and the acceptable quality level. A confidence interval 
answers such questions. It is constructed from the sample data in such a 
way that 95 percent, or any other desired proportion y, of the samples will 
lead to confidence intervals that overlap the true process average. 

For the purpose of computing the process average, all data in the 
samples from a number h of consecutive lots are thrown together into one 
large sample. The mean x and the standard deviation s of this combined 
sample are computed from formulas (4) and (6). Alternatively, they 

* For a general discussion of the principles and methods of tightening and reducing 
inspection, see Statistical Research Group, Sampling Inspection . The discussion 
there relates to inspection by attributes, but, insofar as it runs in terms of the OC 
curve and the amount of inspection, it is also applicable to variables. 
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may be computed from the means x% } the standard deviations and the 
sample sizes Ni of the individual samples by the following formulas: 

* __ N \Xi + N 2 x 2 + • • • + NhXh INiXi 


N i + N 2 + 


(N 1 


s 2 = 


1 )s\ + (Nt - l)sl + • 

+ Ni(x\ - x ) 2 + N 2 (x 2 — x) 2 + 


+ N k N 
+ (N k ~ 1 H 


(81) 


Ni + N 2 + 

S(ATi - l)s? + SJ Va 2 


+ N h - 1 


+N h (x h - *)* 


(Z NiXi ) 2 

N 


N 


wh'.re 


N - 


(82) 


(83) 


From s, and f we compute the observed difference between U 
hi d the sample mean in units of the sample standard deviation 


K = 


U - x 


(84) 


Then, given a confidence coefficient 7 , a confidence interval which in a 
fraction 7 of samples will overlap the true fraction defective may be 
obtained from formulas (42) and (43) by substituting k Q for k and (1 — y)/2 
for L p . The p and p' given by (42) and (43) are the upper and lower con¬ 
fidence limits p y and p Y , respectively, for the fraction defective produced 
by the process. The larger the confidence coefficient, the broader the 
interval; and the smaller the confidence coefficient, the narrower the 
interval. In fact, if 7 is 1, the interval runs from 0 to 100 percent; 
and if 7 is 0, the interval includes only the point p = p, where p is the 
probability that a normal deviate will exceed k 0 . 

If the acceptable quality level lies above the confidence interval, 
reduced inspection is permissible. * If it lies below the confidence interval, 
tightened inspection is necessary. If it lies within the confidence 
interval, normal inspection may be continued. 

In order to avoid too frequent shifts in the degree of inspection, it is 
advisable that the criteria for resuming normal inspection when tightened 
or reduced inspection is in effect should differ from the criteria for remain¬ 
ing on normal inspection when it is already in effect. One way to achieve 
this is to use a larger confidence coefficient when deciding whether to 
continue normal inspection than when deciding whether to resume nor- 

* For reduced inspection, it is customary to impose a further requirement to 
restrict the variability of the process average from lot to lot. This usually takes the 
form of requiring that each of the last twenty (or some other number) lots have been 
acceptable. 
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mal inspection. Reduced inspection, for example, will be installed as 
soon jas the process average falls to a certain level, but once it has been 
installed normal inspection will not be resumed unless this level is exceeded 
by a certain margin. It is therefore possible, once either reduced or 
tightened inspection has been installed, to continue it under conditions 
that would not justify its introduction were normal inspection in effect. 

8.2.2. Illustrative Computation of Process Average.—Suppose that 
the combined sample from several inspection lots contains 95 observa¬ 
tions, that their sum is 2,872, that the sum of their squares is 89,175, 
and that values above 35 represent defects. Suppose a confidence coeffi¬ 
cient 7 C of 0.99 is used for judging whether to continue normal inspection 
and a confidence coefficient y r of 0.95 for judging whether to resume nor- 


mal inspection. Thus 



N = 95 2z = 2,872 

2x 2 = 89,175 

(85) 

C7 = 35 7 . = 0.99 

7, = 0.95 

(86) 

Then, from formulas (4) and (6) 



x = 30.2316 s 

= 5.000 

(87) 

so 



. 35 - 30.2316 

ko = 5 

= 0.9537 

(88) 


Entering Table 1.5 with K< = 0.9537, we find (by linear interpolation) 
17.01 percent as the point estimate p, corresponding with a confidence 
coefficient of zero, of the process average percent defective. 

To find the 99 percent confidence limits, we substitute k ~ 0.9537, 
L p — (1 — 0.99)/2 = 0.005 into (42) and (43). The details of a similar 
computation are shown in Sec. 6.3.1. The 99 percent confidence limits 
are 

P0.99 = 10.15 percent and $0.99 = 26.29 percent (89) 

Similarly, taking L p = 0.025, the 95 percent confidence limits are 

p 0 . 9 B = 11.57 percent and $ 0.95 = 23.86 percent (90) 

If normal inspection is in effect, it will be continued if the AQL for the 
product is between 10.15 and 26.29 percent defective; reduced inspection 
will be installed if the AQL exceeds 26.29 percent and tightened inspection 
will be installed if the AQL is below 10.15 percent. If tightened or 
reduced inspection is in effect, normal inspection will be resumed if 
the AQL is between 11.57 and 23.86 percent. If the difference between 
the intervals for continuation and for resumption is too small, that is, if 
too many shifts occur, it can be widened by increasing the difference 
between the two confidence coefficients used. 
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For routine operations in which the same AQL and the same value ot 
U are to be used repeatedly, it is desirable to prepare a table showing for 
various sample sizes the values of k 0 that result in continuing normal 
inspection if it is in use and those that result in resuming normal inspec¬ 
tion if tightened or reduced inspection is in use. In the present illustra¬ 
tion, for example, if the AQL is 15 percent, a value of k 0 computed from 
95 observations leads under normal inspection to continuing normal 
inspection if it lies between k 0.99 == 0.7380 and £ 0.99 = 1.4107, or leads 
under reduced or tightened inspection to resuming normal inspection if it 
lies between fco.95 = 0.8048 and fc 0 . 9 & = 1 3113. These values are deter¬ 
mined from formula (21), using both the p us and the minus signs to obtain 
upper and lower limits and taking p 1 =- AQL, a = (1 — y)/2. An exam¬ 
ple «>f the computation of (21) is given in Sec. 4.5.2. 

8.3. Methods of Reducing or Tightening Inspection 

f 

Reduced inspeMon may be achieved by reducing N, perhaps as 
much as 40 or 50 percent, and adjusting k so that the producer’s risk 
will not be increased. This leaves the OC curve approximately unchanged 
for qualities equal to or better than the AQL but increases the risk that 
lots of worse quality will, if submitted, be accepted. 

Tightened inspection may be achieved by increasing k but leaving N 
fixed. This moves the OC curve to the left, thus reducing the consumer’s 
risk but increasing the producer’s risk. 

9. CHOICE OF QUALITY STANDARDS AND EVALUATION OF PROPOSED 

SAMPLING PLANS 

9.1. General Discussion 

The definition of a defective item, the choice of an AQL and an 
LTPD,* and the selection of producer’s and consumer’s risks usually 
require a compromise between what is technically or economically desir¬ 
able in the use for which the product is intended and what is technically or 
economically feasible in its production. Although no fixed rules can be 
established for selecting U , p\, P 2 , «, and 0, a few guides are worth noting. 

* LTPD, or lot tolerance percent defective, is defined sometimes in terms of tho 
use for which the product is intended, as the worst quality for which there should be 
any appreciable probability of a lot's being accepted, and sometimes, in terms of the 
sampling plan, as that quality for which the probability of acceptance has a specified 
value (customarily 10 percent). Similarly, the AQL, or acceptable quality level, is 
defined sometimes as the worst quality that can be consistently accepted or as the 
best quality for which there should be any appreciable probability of a lot's being 
rejected, and sometimes, as that quality for which a certain sampling plan gives a 
specified probability of rejection (customarily 5 percent). Wc follow the definitions 
in terms of sampling plans, using the customary risks of 5 and 10 percent for the 
AQL and LTPD, respectively. 
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Some of these five quantities can be selected arbitrarily or by conven¬ 
tion without restricting the range of available plans. For example, 
it is more or. less conventional to take the producer’s risk a as 5 percent 
and the consumer’s risk fi as 10 percent. Then the AQL, or p h may be 
defined as that quality for which the probability of acceptance should be 
95 percent and the LTPD, or p 2 , as that quality for which the probability 
of acceptance should be 10 percent. Sometimes a and 0 are set tenta¬ 
tively and values of £ 7 , p i, and p 2 selected; then the values of a and $ may 
be reexamined in the light of the number of lots of qualities pi and p 2 
likely to be submitted and the costs of rejecting those of quality pi or 
accepting those of quality p 2 . 

To select a sampling plan, two independent analyses should be made 
and their results merged. One study should consider the uses to which 
the product is to be put. What is required of each item, how rigid are 
these requirements, and what is the effect of a certain percentage of the 
items failing to meet the requirements? The other study should consider 
the production process. Within what limits do most of the items fall, 
how stable are these limits, what would be the costs of improving them, 
and what is the cost of having a lot rejected? In designing a sampling 
plan on the basis of these two studies, it is necessary to take into account 
the cost of inspection. What is the cost of drawing a sample, what is 
the cost of inspecting the items, what is the cost of analyzing the data, 
and how do all these vary with the size of the sample? 

9.2. Analysis of Production Possibilities 

It should be assumed that the producer will consider the OC curve as 
in effect the specification of what will be accepted, and that he will adjust 
his operations to an OC curve as best suits his own circumstances. For 
example, if the producer finds that a rejected lot entails considerable 
costs, he will consider that the given OC curve actually forces him to 
produce at a quality much better than the AQL, how much better depend¬ 
ing upon how sharply the cost of production rises and the probability of a 
rejection declines as quality improves. If improvements in quality are 
also costly, a given OC curve may mean that the consumer will receive— 
and, of course, ultimately pay for—better quality than he needs. Under 
such circumstances the consumer will find it advantageous to increase the 
amount of inspection, for which also he ultimately pays; this will steepen 
the OC curve, that is, increase the rate at which the probability of a 
rejection declines as quality improves, and therefore reduce the degree to 
which the producer is forced to improve the quality and raise the cost.* 

There is obviously a relation between U on the one hand and p\ 

* Some discussion of this point is given in J. Przyborowski and H. Wilefiski, 
“Testing Clover Seed,” especially Sec. Ill(6), pp. 281-285. 
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and p 2 on the other. If U is set very high, only very bad items are 
considered defective; only a small percentage of such items can be 
accepted, so p x should be small. If U is set very low, items are considered 
defective when they deviate very little from normal; the effect of such 
items is less serious, so a larger percentage of them can be accepted. 

If we regard U as fixed, we may, by analyzing data on the quality 
characteristic that is to be inspected, estimate what proportion defective 
is being produced. Section 8.2 gives a method for finding confidence 
intervals p y to p y which, in a proportion y of the samples from which 
such confidence intervals are computed, overlap the true fraction defec¬ 
tive; or, if the confidence coefficient is redr c 3 d to zero, we may obtain a 
single-valued estimate p. The height of the OC curve over the interval 
p y to p.. then indicates approximately how much of the current production 
would >e passed by a proposed stmpling plan. 

If, on the otbe** hand, v e regard pi as fixed, we may estimate the 
v alu** of U necessary to ;» confidence y that the fraction defective will 
not exceed p x . This v*Jue is 

U y = x + k y s (91) 

where fc y is given by formula ( 21 ), using the plus sign and taking p x as 
the fixed fraction defective and a as 1 — 7 . Thus, in the example of 
Sec. 8.2.2, where p x = 0.15, x = 30.232, and s = 5.000, £ 0.95 = 1.2632 
and £ 0.99 = 1.3695; so we may be 95 percent confident that under the 
conditions prevailing when the sample was collected at least 85 percent 
of the items lie below U 0.95 = 36.54, or 99 percent confident that at least 

A A 

85 percent lie below Uo.n = 37.08. U is called a “tolerance limit.”* 

9.3. Analysis of Consumption Requirements 

In analyzing the consumption, or design, requirements of a product, 
account should be taken of the variability of other products or methods 
with which it will be associated in use. A discussion of this is beyond the 
scope of this chapter, f but one especially simple example may be men¬ 
tioned. 

* See Chap. 2 for a discussion of tolerance limits. Chapter 2, however, is con¬ 
cerned with two-sided tolerance limits, that is, values of k such that at least a given 
proportion P of the population is included between the limits & — ks and f -f ks. 

An excellent discussion of tolerance limits, which in fact is the original source 
from which subsequent theoretical work has developed, is given in Shewhart, Statistical 
Method , Chap. II, pp. 50-79, How Establish Limits of Variability? 

t See Shewhart, Economic Control of Quality , Part V, pp. 247-272, Statistical 
Basis for Specification for Standard Quality, especially Chap. XVII, pp. 249-261, 
Design Limits on Variability, in which Sec. 5, pp. 252—256, considers Tolerances for 
Quality of Finished Product in Terms of Tolerances of Piece-parts, and Sec. 6, pp. 
256-259, discusses The General Problem of Setting Tolerances on Controlled Produot. 
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Suppose the items from a lot are to be assembled at random in sets of 
n, and a set of n will be defective if the sum of the lengths of the n pieces 
exceeds nU f , that is, if the mean length in the set exceeds TJ f . If we 
consider lot quality measured by the percent of means of n that exceed U' 
and select two points on the desired OC curve, values of N and k , say N' 
and fc', can be obtained from (10) and (11) or (12). If now inspection is 
carried out on the basis of individual items, rather than sets of n, the 
lot can be accepted if in a sample of nN f items 

x + ~ S < U’ (92) 

y/n 

where x and s are the mean and standard deviation computed from the nN f 
items. 

If in this example a piece were considered defective if it exceeded one- 
nth the allowable sum for n pieces, that is, if it exceeded IV, the sampling 
plan would require a sample of N' items and the acceptance requirement 
would be 

x + k's < IV (93) 

This would necessitate production at a higher quality than would (92). 
For example, if n = 4 and k = 0.7045, the indifference quality, that is, 
the quality having an equal chance of acceptance or rejection, is 35.12 
percent defective for (92) and 22.23 percent for (93). If n = 9 and 
k = 2.0077 (the k given in Table 1.2 for pi = 0.015, p 2 = 0.03), the indif¬ 
ference quality is 25.17 percent under (92) and 2.23 percent under (93). 

As a digression, it may be observed that averages in sets of n may be 
appropriate measures of quality when inspection is undertaken to deter¬ 
mine whether full measure is delivered. Suppose tomato juice is packed 
in one-gallon cans* that the final purchasers buy one at a time.. Then 
any purchaser whose can contains less than one gallon is cheated, whether 
or not some other purchaser receives an equivalent or even a greater 
excess. Even though the purchaser would not be cheated in the long run 
if his purchase were repeated frequently, an appreciable shortage in any 
one can may inconvenience or annoy him—in fact, may prevent his 
repeating the purchase. In such circumstances, the appropriate meas¬ 
ure of lot quality is the percent of cans containing less than a stated 
amount, say one gallon, or perhaps 0.99 gallon if a 1 percent deviation is 
permissible. If, however, the tomato juice is to be sold to restaurants in 
cases of 12 cans, the appropriate measure of lot quality is the percent of 
groups of 12 that average less than one gallon per can. If the tomato 
juice is to be sold to Army camps in carloads, the percent of carloads 

* The example is taken from Peach, Industrial Statistics , p. 138. Peach, however, 
treats the example differently. 
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averaging less than one gallon per can is the appropriate measure of lot 
quality; in such a case, however, one carload might well be an inspection 
lot, so that the lot mean would become the appropriate quality measure. 

9.4. Normality Assumption 

If within lots the distribution of items is not approximately normal, the 
sampling plan usually controls the percent defective as consistently as if 
the distribution were normal, but not at the level intended. 

What the plan actually controls is the quantity defined by equation 
(1), namely, 



Thus a lot is rejected if the sample value of K, say k Q , is too small and 
accept'd if it is sufficiently large. The OC curve might be plotted with 
the horizontal ayi? representing the true value of K in submitted lots; 
,-iUch an OC curve woubi be essentially correct regardless of the normality 
of the distribution within lots. The assumption of normality becomes 
important when translating values of K into values of p, the fraction 
defective. 

If the distribution is not normal but its form is stable from lot to lot 
and is such that the percent defective is determined by K , then it is possi¬ 
ble to substitute for Table 1.5 another table more appropriate for translat¬ 
ing K into percent defective. In any event, it may be assumed that the 
fraction defective corresponding with any K does not exceed (1 /K) 2 ; and 
if the distribution is unimodal and has its mode near its mean it may bo 
assumed that for any K the fraction defective does not exceed (2/3 K) 2 * 

The effects of non-normality are excellently stated by Jennett and 
Welch :f 

The ^jpove theory is developed on the assumption that the parent population 
sampled is Gaussian. Even if it is not, the reasoning relating the distribution of 
k 0 with K will not be invalidated unless the samples are very small, because non¬ 
normality will not have much influence on x and s. The chief error lies in con¬ 
necting K with p by using the normal tables. This can be obviated if some other 
constant distributional form can be assumed for which the relation between the 
standardized deviate K and p is known. If we could say that the distribution 
would maintain the same shape . . . , although with possible changes in the 
mean and standard deviation, the above theory would need little modification. 

* These quantities are based on the Tchebycheff and Oamp-Meidell inequalities, 
respectively (see Shewhart, Economic Control of Quality, pp. 176-177); in the present 
context they can be halved if symmetry can be assumed, and reduced further if 
positive skewness is assumed. 

t “The Control of Proportion Defective.” The passage quoted is from Sec. 3, 
p. 88, Effect of Non-normality. The notation in the passage as quoted has been 
altered to agree with that used in this chapter. 
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It would be misleading, however, to lay too much emphasis on the necessity 
for making the connection between K and p correctly. Usually it is not possible 
to say with absolute certainty that a particular value of p is satisfactory and 
another, differing little from it, is unsatisfactory. Indeed, often it may not be 
clear that p has any intrinsic importance at all, but it may still be useful for 
purposes of control. For the same reason K may be regarded as a useful feature 
of the distribution to aim at controlling, without inquiring too closely into the 
exact meaning of K. Non-normality is therefore not such a serious matter as it 
may appear. However, no final word can be said about this without reference 
to the context in which the method is to be used. 

10. SIMULTANEOUS USE OF SEVERAL SAMPLING PLANS 
10.1. Inspection of Several Quality Characteristics « 

Often the quality of an item depends upon several variables. There 
may be measurements of physical dimensions, for example, those which 
determine whether the item will fit properly in a final assembly, or 
whether full measure has been delivered; measurements of the properties 
of the item, for example, hardness, chemical composition, tensile strength, 
or modulus of elasticity; measurements of performance, for example, 
duration of useful life or rate of output; and so forth. The lot quality 
then is measured by n fractions defective, say pi, p 2 , . . . , p n . 

Suppose that n criteria of the form 

x + ks < U (95) 

are applied to a single sample of N. The probability that the lot will 
pass all n tests cannot be read from the OC curve for any one test. For 
a given set of values pi, p 2 , . . . , Pn, the probability that a lot will be 
accepted cannot be more than the smallest among the n values of L p ; 
it will be this large if, and only if, the n tests are perfectly correlated, so 
that a lot passing any one test is certain to pass all. At the other extreme, 
the probability of acceptance may be zero; it will be this small if «(K>t that 
passes one test is certain to fail some other, or if L v is zero for any of the n 
values of p.* 

* It might be thought that these two extreme cases would never be encountered in 
practice. Specification writing and acceptance test designing being what they are, 
this is not the case. The author cherishes the memory of one particular specification 
for a cylindrical tube. It specified length, inside diameter, outside diameter, con¬ 
centricity, weight, density, absence of voids, and all the details of chemical composi¬ 
tion, physical properties, and manufacturing processes that determine density. Such 
a specification must be, at least in some degree, cither redundant or inconsistent; this 
particular one perhaps had been redundant originally, but through an encrustation of 
changes made in one detail at a time it had become inconsistent. It would be naive 
to assume, however, that because no material could meet the acceptance criteria none 
was accepted. On the contrary, all of it was accepted. This was achieved partly by 
the buyer’s reasoning that the product was needed, that the acceptance tests failed 
to pass usable material, and that therefore material failing to pass the tests was usable; 
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Between these two extremes of perfect positive correlation and 
perfect negative correlation among the tests is the case in which the tests 
are independent. In this case, the probability of accepting a lot of quality 
Pb P*t • • • > Pn is given by the product of the individual probabilities of 
acceptance: 

LpitPiy ... , p, = L p j • L Pi . L Pn (96) 

Since each L Pi is 1 or less, this product is less than the least L Vi among the 
set of n. If n = 3 and each L Pi = 0.95, the product is (0.95) 8 or 0.857; 
if each Lpi = 0.99, the product is (0.99) 8 or 0.970. For n = 10 the cor¬ 
responding products are 0,599 and 0.904. (When L Pi is near 1 and n is 
not too large, the probability of rejecting is about n times as large for n 
independent tests as for one.) 

Sometimes the situs'ion is th it for good quality the various tests are 
independent but that fcK po *r qualify the tests are positively correlated. 
Thu?, if for each of i£y*. characteristics a lot has fractions defective that 
give it a 95 percent chance of acceptance on the individual tests, its 
chance of passing all five tests is about (0.95) 6 or 0.774. But if its chance 
of acceptance on each individual test is 0.10, its chance of acceptance on 
all five may be nearly 0.10. In such situations, not only is the OC curve 
for the battery of tests lower than that for any one test, but it gives less 
sharp discrimination between good and bad lots. On the other hand, if 
the tests are independent or positively correlated for good quality but 
negatively correlated for bad quality, the discrimination is sharpened. 

Another aspect of the simultaneous use of several criteria like (95) is 
that this treats the various tests on an attribute basis, so to speak. If a 
lot just barely fails to pass one test but appears to be of exceptionally high 
quality on all other tests, it is rejected; whereas a second lot that just 
barely passes each test is accepted. It would be preferable to combine 
all the measurements on one item into a single score for that item and 
judge the lot by the scores of the sample items. * 

All these uncertainties about the simultaneous operation of several 
criteria make the selection, operation, and interpretation of the plan much 
less clear-cut and straightforward than when only one criterion is con- 

and partly by the seller's disclaiming responsibility for the quality of the product, 
reasoning that he could take responsibility either for the quality of the product or for 
following detailed instructions for making it, but not for the consistency of those 
instructions with the desired quality. 

* Chapter 3 deals with this problem. The methods presented there are new and 
have not yet been used, extensively or developed to the point where they are com¬ 
parable with those of this chapter in simplicity of selecting plans and constructing 
OC curves, or in ease of routine operation and interpretation. They will nevertheless 
well repay careful study, especially when complicated and expensive products or 
experiments are concerned. 
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cerned. They underscore the necessity, if variables are to be used, of a 
careful preliminary analysis of a rather large body of data on the product 
to which the plans are to be applied. 

10.2. Upper and Lower Bounds on a Single Quality Characteristic 

10.2.1. General Considerations. —The OC curve for two simultaneous 
criteria of the form (96) can be determined mathematically in the case 
where the two criteria simply impose upper and lower limits on a single 
characteristic; that is, the case where an item is defective if it is either 
above some upper limit U or below some minimum M. Some mathe¬ 
matical investigations of this have been made, * but no completely satis¬ 
factory solution has yet been obtained. The indications are, however, 
that for practical purposes it will ordinarily suffice to use two criteria of 
the form (96) determined independently for the upper and lower limits. 
Two points about this procedure, however, should be recognized. 

The first point is that the combinations of x and s that lead to accept¬ 
ance are not quite the same when two one-sided criteria are used as they 
would be if a single two-sided test were constructed. The combinations 
that lead to acceptance with two one-sided tests (having the same values 
of N and k) are included in an area shaped like the triangle of Fig. 1.8, 
whereas a single two-sided test would result in rounding off the top comer 
of this triangle, somewhat as in the curve of Fig. 1.8. If most of the 
production should fall in the small area not common to both regions, this 
difference would be of importance. This would be a situation in which 
the mean behaved excellently, consistently lying midway between M 
and U f but the standard deviation was continually near the maximum 
value permitting acceptance with the best value of the mean. If such a 
situation were to prevail consistently, acceptance inspection would 
probably be based on the standard deviation alone. 

The second point to be kept in mind in using two one-sided tests 
for a given characteristic is the effect on the OC curve. The true OC 
curve for a two-sided test cannot be expressed in terms of the lot fraction 
defective alone; it depends upon the lot mean and standard deviation. A 
sample that fails to meet one criterion because of a large standard devia¬ 
tion is likely to fail the other criterion also; but a sample that fails one 
criterionvbecause of an extreme mean is likely to pass the other criterion. 

Thus, although the use of two one-sided criteria to control both upward 
and downward variation often works out satisfactorily in practice, it 
should be recognized as an improvisation for a problem that is in essential 
respects different from the one-sided problem and is less amenable to 
simple methods and interpretations. 

* The mathematical work was done in the Statistical Research Group by A. Wald 
and J. Wolfowitz but has not been published. 
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10.2.2. Use of a Pair of One-sided Criteria. —Suppose two tests are 
constructed so that with one the probability is 0 that a lot with fraction p 
(p < 0.5) above U will be accepted and with the other the probability is 0 
that a lot with fraction p below M will be accepted. If the standard 
deviation of the measurements is small, the probability is practically 0 
that a lot with fraction p outside the limits M and U will be accepted; but 
if the standard deviation is large, the probability is practically /3 that a 
lot with fraction 2 p outside the limits M and V will be accepted. If p/2 is 
substituted for p, the test will give the intended probability of acceptance 
for high values of the standard deviation but not for low values of the 
standard deviation. 

If the standard deviation is expected always to be low—it need not be 
so low when the relevant values of p arc small as when they are large— 
or if i )ts with high standaril dev iations are sufficiently infrequent as to have 
only negligible effects on the operation of the tests, a pair of one-sided 
test:j may be used .successfully. The two acceptance criteria, both of 
which must be satisfied before a lot is accepted, are then 


x + hs < XT and x — ks > M (97) 


It may be noted that if x > (M + U)/2, the second inequality can be 
ignored, for it will surely be met if the first is; similarly, if x < (M + U)/2, 
the first inequality can be ignored. 

As in the case of a single criterion, we may note that if 


X > NU or X < NM (98) 

the lot may be rejected without regard to Y, where X and Y are the sum of 
the observations and the sum of the squares of the observations, respec¬ 
tively, as defined by formulas (51) and (52). 

Second, we may note that if 


Y < 


U*(N - 1) 

k 2 + 1 


and 


X > N 


M+U 

2 


the lot may be accepted. 

Third, if neither (98) nor (99) is satisfied, the lot is accepted if 


(99) 


or 

where 


X > N 


M + U 
2 


X < N 


M+U 

2 


a 


and Y < oX 2 - b v X + Ct ; 

and Y ^ ciX* — buX + cu 

Nk* + N - 1 

NW 


( 100 ) 

( 101 ) 

( 102 ) 
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(103) 

(104) 


If neither (100) nor (101) is met, the lot is rejected. 

For adjusted observations, as defined by (71), the lot may be accepted 
without regard to X if 

y, < (U JZ WW ~ 1 ) 

~ fc 2 + 1 

or 

(A - MnN - 1 ) 

- fc 2 + 1 

In place of ( 100 ) arid (101) we have 

X > N — ^ — tad Y' < aX* - (b v + 2A)X + (c„ + NA*) (107) 
X <N M -y - and Y' < aX i - ( b M + 2A)X + (c M + NA*) (108) 


when 

a> m + u 

(105) 

when 

a< m + u 

A ~ 2 

(106) 


The most efficient form of (105) and (106), either for graphical pur¬ 
poses or for permitting immediate acceptance on the basis of Y' alone, is 
obtained by setting 


A = 


M +U 
2 


(109) 


In this case inequalities (105) and (106) are identical. 

For either adjusted or unadjusted observations graphs may be con¬ 
structed as described in Sec. 7.3. The maximum value of Y for accept¬ 
ance for each value of X will be obtained from formula (100) or (107) 
for values of X greater than N (M + V) /2 and from formula (10 1 ) or (108) 
for values of X less than N (M + U)/2. In the case of adjusted observa¬ 
tions, if A = ( M + U)/2 the figures will be symmetrical about the line 
X = NA. The upper boundaries of the error region may be found by 
formula (70) or (74) of Sec. 7.3. 

10.2.3. Two-sided Criterion.—A two-sided criterion is most easily 
applied if expressed in terms of a chart similar to Fig. 1 . 8 , in which values 
of £ are represented horizontally and values of s vertically. For each 
sample, a point is plotted corresponding to the observed x and 8. If this 
point lies within the region bounded by the acceptance curve and the 
horizontal axis, the lot is accepted; otherwise, it is rejected. 

The first step in finding the boundary of the acceptance region is to 
compute k and N from formulas (10) and ( 11 ). The values of pi and p* 
now represent fractions outside the interval M to U } instead of fractions 
above 17. 
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The second step is to find the indifference quality p<>, which is simply 
the probability that a normal deviate will exceed k. It may be deter¬ 
mined from Table 1.5. 

The third step is to divide p 0 arbitrarily into two parts, p£ and 
Po (p£ < Vo) such that 

Vo + Vo = Po (110) 

Each pair pj, p£' leads to two points on the acceptance curve, so enough 
different pairs should be used to permit plotting the curve. 





Fia. 1.8.—Acceptance region for a two-sided test defined by pi *■ 0.15, p% ■■ 0.30, 
a - 0.01, 0 » 0.02, U - 35, and M - 25; also corresponding region for a pair of one-sided 
tests. [Data from Table 1.7 , columns (9), ( 10) and (if).] 


The fourth step is to find K P ’ 9 and X p '', normal deviates exceeded with 
probabilities p£ and p", respectively. These may be obtained from Table 
1 . 1 . 

The fifth step is to substitute each pair of values K v '„ K p '< into the 
pair of equations 

„ + MK P -' _ V - M 

2 ~ K,, + Kp; * = K p ; + K Pl ;• 


( 111 ) 















Sec. 11.1.1 


VARIABLES 


57 


or into the equivalent pair 


U + M U - M Kr'o- Kpo 
2 2 'K^ + K* 


U -M 


Kp 0 + K p 


( 112 ) 


Each pair of values K p ' 0 , K p ' 0 ' when substituted.in to ( 111 ) or (112) leads to 
a combination x,s lying on the boundary of the acceptance region; another 
point on the boundary is given by the combination x\s where 


x' = U + M - x (113) 

Table 1.7 shows the computations for the two-sided acceptance region 
defined by pi = 0.15, p 2 = 0.30, = 0.0?, and 0 = 0.02, where defects 

are represented by values below 25 or above 35. The values N = 95 and 
k = P.7645 are taken from Sec. 4.3. The resulting region is plotted 
in li 5 . 1.8. The figire also shows the acceptance region given by using 
the r jir of one-sided criteria 


£ - 0.7645* > 25 x + 0.76455 < 35 (114) 

It will be observed that the acceptance region for the two-sided test lies 
entirely within that for thu pair of one-sided tests. As x approaches U or 
M , the two boundaries nearly coincide. The difference between the 
regions is fairly sizable in this case, which involves rather large fractions 
defective, but for smaller fractions defective the two regions more nearly 
coincide. For example, keeping M = 25 and U = 35, but taking 
pi = 0.001, p 2 = 0.0015, a = 0.05, 0 = 0.10 (for which Table 1.2 shows 
k = 3.0214) we find that for x = 30 the maximum acceptable value of s 
is given by the pair of one-sided tests as 1.655 and by the two-sided test as 
1.550; and the difference is less for any other value of x . The effect 
of the difference between a pair of one-sided tests and a two-sided test 
should be judged by the frequency of lots leading to sample points that 
would signify acceptance by one test and rejection by the other. One way 
in which the two-sided test presented here is useful is in judging whether 
two one-sided tests can be used satisfactorily. 

If a two-sided test is used, criteria for reducing or tightening inspec¬ 
tion may also be expressed graphically. Upper and lower limits of k 0 
should be computed as described in the last paragraph of Sec. 8.2.2 and 
used as values of k from which to compute two boundary curves accord¬ 
ing to formulas (111) or (112). If the value of k 0 found in computing the 
PA lies between the two curves, normal inspection is continued; if it falls 
below the lower curve, tightened inspection is installed; and if it falls 
above the upper curve reduced inspection is installed. 


11. TECHNICAL NOTES 

11.1. Basic Formulas 

11.1.1. Assumptions.—Sampling plans for controlling percent defec¬ 
tive on the basis of variables involve drawing a sample of AT at random 
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from a lot and computing 
where ■ 


i + ks 


N 


N 2 a ; 2 - ( 2 a ;) 2 
N(N - 1) 


(115) 

(116) 
(117) 


x represents measurements of the variable being inspected, and k is a 
constant. 

If x is normally distributed, x will be normally distributed with mean 
jti and standard deviation a/y/N, where n and <r are the lot mean and 
standard deviation of x. Also, unless N is extremely small, say, less than 
5, s will also be nearly no rmally dist ributed, its mean and standard 
deviation being a<r and bo/\/2(N — 1 ), respectively, where 



The factors a and b are slightly less than 1 — when N = 5, for example, 
a = 0.940 and b 0.965 — and they increase toward 1 as N increases.* 
They are therefore treated as 1 throughout this chapter; that is, the stand¬ 
ard deviation is regarded as normally distributed with mean <r and stand¬ 
ard deviation <r/y/2(N — 1). 

Since a linear combination of independently distributed variables 
tends to be normally distributed even when the individual variables 
depart from normality, x + ks will be more nearly normally distributed 
than s; in fact, x + ks will tend toward normality even if the distribution 
of x departs from normality. The mean of x + ks is m + k<r and the 
standard deviation is 

+ 2(i\r - i) (120) 


* A table of a and b is given in Jennett and Welch, “Control of Proportion Defec¬ 
tive," p. 85. This table gives a and 6 to 3 decimal places for values of N from 5 to 15 
by steps of 1, from 15 to 30 by steps of 5, and for 40, 50, 75, and 100. A table of 
ci » a y/(N — ijjN to 5 decimal places for values of If from 3 to 25 by steps of 1 and 
from 25 to 100 by steps of 5 is given by Shewhart, Economic Control of Quality , p. 185. 
Values of c* « a y/(N — l)/N are given to 4 decimal places for values of N from 2 to 
25 by steps of 1 in American Society for Testing Materials Manual , p. 50 of Supple¬ 
ment B; the same table is given in American Standards Association, Control Chart 
Method, p. 39. 
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11.1.2. Direct Derivation.— The probability that in a sample £ + ks 
will be less than V then depends on the difference between U and v + hr, 
measured in units of the standard deviation of 2 + ks; that is, the proba¬ 
bility that £ + ks will be less than U is the probability that a unit normal 
deviate will be less than 


U ~ (m + ktr) 




"I" o/v 


A 3 


( 121 ) 


IN ' 2(N - 1) 

The probability will be t that the test will result in acceptance if 


’4b + \ 


wheris defined by 

We may v rite (122) 

U-n 


(m + ka) 

(122) 

h 2(N~'T) 




vs _/». ‘ ut ~’ 

(123) 

k ~ K ‘^N + 2(N - 1) 

(124) 


(U — n)/tr is a unit normal deviate that will be exceeded with some 
probability p, where p is the proportion defective in the lot, so we may 
write 


K„ = 



(125) 


Substituting this into (124) and rearranging, we find 

k-K r + K^/i + (126) 

If both sides of (126) are decreased by K p and then squared, a rear¬ 
rangement of terms gives 

(,27) 

Denoting the coefficient of k 2 by a and the term independent of k by 6, k 
may be expressed in terms of N , K Pi and K «as 

jfc = K r ± (128) 

a 

If Kp + k is subtracted from both sides of (126) and the equation 



'60 


TECHNIQUES OF STATISTICAL ANALYSIS 


Chap. 1 


multiplied by — 1, we have an equation expressing the fraction defective, 
in terms of a normal deviate, as a function of N, k ) and c; that is, the 
abscissa of the OC curve is expressed as a function of the ordinate of the 
OC curve and of the parameters defining the sampling plan. 

Suppose we desire to choose N and k so that the probability will be 
1 — a of accepting a lot of quality p x and ft of accepting a lot of quality 
p 2 , where a + /?< 1 and p 2 > p x . Substituting p x and 1 — a into (126) 
gives one equation in N and fc, and substituting p 2 and p gives another 


k-K, + K,^ + w ^- T) (130) 


where K x and K 2 have been written instead of K Pl and K Pv After sub¬ 
tracting Ki from (129) and K 2 from (130) and dividing (129) by (130), 
we have 


k- K x _ K^ a 
k - K 2 Kp 


(131) 


Solving this for k and replacing K X - a by — K a gives 

L _ K a K 2 + KeKx 
K a + K fi 


(132) 


N may then be determined by substituting k into (129) or (130) and 
solving. The solution will be considerably simpler and there will be 
little if any loss in accuracy if in the denominator of the second term under 
the radical iV — 1 is replaced by N. With this approximation (129) 
becomes 

k = Ki + ~/^yl 1+ \ (133) 

Subtracting K\ from both sides, multiplying by y/N/(K x — k), squaring, 
and replacing K X - a by —K a yields 


Similarly, (130) leads to 



(* - K*y 


(134) 


(135) 


By adding (134) and (135) and replacing the k of the denominators by 
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(132), N may be expressed as in (12); and by substituting for k in the 
numerators as well as the denominators, N may be expressed directly in 
terms of K h K a} and K& as in (11). 

If instead of replacing N - 1 by N in (129) and (130) we replace N 
by N — 1, the right-hand sides of (134) and (135) are not affected; but 
the left-hand sides become N — 1. This approximation therefore leads 
to a value of N one unit larger than that given by (134) and (135). Since 
N must be an integer, the effects of rounding are of the same order of 
magnitude as those of replacing N — 1 by N. It is because of this 
approximation that values of N given by (134) and (135) that are not 
integers should be rounded up ins tead of :o the nearest integer. If the 
approximation is not made, we find 


, r b -4- y/V- 16a 

1 4a 

(136) 

(K\ - a' 2 Y 

(137) 

a vx; +kJ 

b = fc 2 + 2(a + 1) 

(138^ 


11.1.3. Derivation from General Formulas for Sample Size and 
Critical Level. —As an alternative derivation of formulas (132) and 
(134), we may start from the fact that x + ks is normally distributed 
about a certain mean in good lots and about another mean in bad lots and 
employ the ordinary procedures for testing the mean of a normally 
distributed variable. Let z represent a normally distributed variable 
whose mean is Z and whose standard deviation is a function of the sample 
size of the form 

<r(N) = a * F(N) (139) 


where F(N) is any function of N that is independent of <r. Suppose a 
test is required such that the probability is a of rejecting a lot in which 
the mean is Z i and p of accepting a lot in which the mean is Z 2 , where 
Z 2 > Z i. The test is made by finding z in a sample of N and accepting 
the lot if z < A or rejecting it if z > A, where A is a constant. Then 
if a lot has a mean of Z i, the probability L\ of acceptance is 


L\ = 


a(N) V*ir 

A 

=jl r 

V2* J-. 


jy 


dz 


A -Zi 

(N) 


t* 

2 dt 


(140) 


t = 


z — Z\ 


where 


(141) 
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If the lot mean is Z% the probability L 2 of acceptance is 


Z/2 = 


<r(N) \/2ir 


/- 


A 


(z-Z*)* 
*W) fa 


1 


A-Z a 

■m 




ij 

~*dt 


where t is defined by (141) but in terms of Zi instead of Zi. 
priate test is then defined by values of A and N such that 

Li = 1 — a 


that is, 


Li = /3 


A - Zi 
<r(N) 

A - Z 2 
o(N) 


= K a 
= -K» 


(142) 


The appro- 

(143) 

(144) 

(145) 

(146) 


On dividing (145) by (146) and solving for A, we find 


A = 


K a Zi + KgZi 

K a ^K 0 ' 


(147) 


On multiplying both (145) and (146) by <r(N), subtracting (146) from 
(145), replacing <r(N) by a ■ F(N), and solving for F(N), we find 


' (A ° = .-(fefe) 


(148) 


In the ordinary case of testing the mean of a normally distributed varia¬ 
ble, F(N) is simply \/%/N, and this leads to N = 1/a, where a is defined 
by (137). 

in applying formulas (147) and (148) to the case of percent defective, 
we have 


z = x + ks 

(149) 

7j = /jl + kc 

(150) 

- Jn + 

(151) 

A lot is considered good if 


21 1 

IV 

(152) 

and bad if 


U ~><Ki 

<T 

(153) 


where K i > K 2 . From (152), using the equality sign, we find 

— K x <7 (154) 
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Substituting this into (150) gives 


Similarly, 
Substituting from 


Zx = U + (* — K{)o 
Z 2 = U+(k- Ki)<r 

(151), (155), and (156) into (148), we have 


(155) 

(156) 


4 


J_ k 2 

AT I O/ 


N ' 2(N - 1) 


K i - K 2 
K a + K„ 


(157) 


for which (136) is the solution. Substituting from (155) and (156) into 
(147) gives 


A = 


Kg[U ±Jk- KM + K »\ ? T + (k - K,)a] 

K a + Kfi 

K a (V f ka) + KeyU + ka) K a Ks + K e K, 


k < •+■ Kp 
A // “l - ka " 


K a + K„ 
K a K , + KtK, 

K u + K, ° 


(158) 


(159) 


Any combination of values of k and A that satisfy this equation will bo 
satisfactory values for the multiple of the standard deviation and the 
rejection level of x + ks, but most pairs will be unusable because calcu¬ 
lation of the rejection level A from (159) will require knowledge of a. 
If, however, we set 

= KJC t + K t Ki 

- K. + K, (Ib0) 


this problem is solved by the disappearance of <r, and the rejection level 
becomes simply 

A = U (161) 

The value of k from (160) may then be substituted into (157) to deter¬ 
mine N. If we replace N — 1 by N, the value of N given by (157) is the 
same as that given by formulas (134) and (135), otherwise it is the same 
as that given by (136). 

11.1.4. Two-sided Criterion.—The basis for the two-sided test pre¬ 
sented in Sec. 10.2.3 is the fact that, because x + ks can be assumed 
symmetrically distributed about /x + ka, the boundary between accept¬ 
able and unacceptable combinations of x and s corresponds with the 
boundary between combinations of m and a for which a lot has more or 
less than a 50 percent chance of acceptance. Thus, in Fig. 1.8 if the 2 
and s axes are interpreted as m and a axes, any lot represented by a point 
on the boundary line has an equal chance of acceptance or rejection, 
that is, corresponds with the indifference percent defective. For the 
two-sided test it is clear that the indifference quality is the same as for a 
one-sided test; for when a is very small and m is near U or M, the two- 
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sided and one-sided tests become virtually equivalent. Thus, the pro¬ 
cedure for finding the boundary of the two-sided acceptance region begins 
by finding the proper indifference quality po. For any value of ju, a 
value ofV is found such that the fraction p' 0 below M plus the fraction p" 
above U is equal to p 0 ; these values of p and <t are then taken as the 
coordinates of a point x f s on the acceptance boundary. For computa¬ 
tional purposes, starting with and finding the proper a is cumbersome, 
but the parametric equations (111) and (112), which start with an 
arbitrary division of po into two parts p f Q and p", are easily used. Given 
pi and p'o , we have 

H - K# = M (162) 

and 

M + K p ' 9 '<r = U (163) 

Equations (111) and (112) represent solutions of (162) and (163) for /x 
and <r in terms of U , M, K P9 , and K P9 ’, and thus give the coordinates of a 
point x,s on the acceptance boundary. 

11.2. Approximations in the Basic Formulas 

Four incorrect assumptions are made in deriving the preceding 
formulas: s 

(1) That x + ks is normally distributed; actually, it is not quite 
normally distributed, since s is not. 

(2) That $ is distributed about a mean of a; actually, it is distributed 
about a mean of a<r, where a < 1 and is defined by equation (118). 

(3) That $ is distributed with standard deviation a/\/'2(N — 1); 
actually, it is distributed with standard deviation of b<r/\/2(N — 1), where 
b < 1 and is defined by equation (119). 

(4) That $ is distributed with standard deviation <r/y/2N . This 
assumption applies only to formulas (134) and (135) for N. 

All these assumptions can be avoided by utilizing the non-central 
t distribution, tables of which have been published by Johnson and 
Welch. In fact, the values of N and k and the corresponding true values 
of pi and p 2 given in Table 1.2 were computed with the aid of tables of 
the non-central t distribution. The effect of the approximations in the 
formulas can therefore be examined by comparing the values they give 
with those given in Table 1.2. Such a comparison exaggerates somewhat 
the inaccuracy of the formulas because in preparing the table some effort 
was expended, as is explained in Sec. 11.5, to make the best adjustment 
to the discontinuous nature of N , rather than entering the first values of 
N and k found. Similar adjustments could be made in using the for¬ 
mulas. Section 4.7 includes a comparison between the results given by 
the formulas and those in Table 1.2, the comparison being made for the 
smallest sample size, namely, 6, included in Table 1.2. (It will be noted 
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that all four of the incorrect assumptions underlying the formulas rapidly 
become very nearly correct as N increases.) 

TABLE 1.8 

Comparison of Approximate and Exact Values of N and k 
The exact values of k in column (2) have been computed from the non-central 
t distribution, taking pi = 0.05, a = 0.01, and N as shown in column (1). The exact 
values of p 2 for which 0 * 0.10 have then been computed and entered in column (3). 
From these p 2 , taking 0 = 0.10, pi = 0.05, and a = 0.01, approximate values of N and k 
have been computed from formulas (10) and (11) or (12). The true values of pi 
and p 2 for which a = 0.01 and 0 = 0.10 have been computed from the non-central t 
distribution. 


Exact 

| Approximate 

; 

k 


I N 

k 

true pi 

true p* 

T) 

<%) ; 

j (3i 

(4) 

(5) 

(6) 

(7) 

5 

0.5445 

1 

| 0.5428 

5 1 

0.5150 

0.0541 

0.5519 

10 

0.8037 

0.3774 

10 

0.7856 

0.0524 

0.3830 

15 

0.9292 

0.3033 

15 

0.9163 

0.0517 

0.3071 

20 

1.0083 

0.2604 

20 

0.9983 

0.0513 

0.2633 

25 

1.0643 

0.2332 

25 

1.0560 

0.0510 

0.2344 

30 

1.1069 

0.2119 

30 

1.1000 

0.0509 

0.2137 

35 

1.1409 

0.1966 

35 

1.1348 

0.0507 

0.1981 

40 

1.1688 

0.1846 

40 

1.1633 

0.0507 

0.1859 

45 

1.1922 

0.1748 

45 

1.1874 

0.0506 

! 0.1759 

50 

1.2125 

0.1666 

50 

1.2082 

0.0505 

0.1676 

65 

1.2592 

0.1489 

64 

1.2556 

0.0501 

0.1500 

75 

1.2828 

0.1404 

74 

1.2797 

0.0501 

0.1413 

100 

1.3264 

0.1256 

99 

1.3241 

0.0501 

0.1262 


Table 1.8 shows a few comparisons of the values of k and N given 
by formulas (132) and (134) or (135) with the exact values. In these 
cases a has been fixed at 0.01 and p t at 0.05. Various values of N have 
been selected and the resulting lot quality p 2 for which 0 =*= 0.10 has been 
calculated from the non-central t distribution. Thus the exact values of 
N and k result in an OC curve passing exactly through the points defined 
by pif<x and p 2 ,/?. If pi,a and p 2 ,0 are both selected arbitrarily, it is not 
usually possible to find values of N and k such that the OC curve passes 
exactly through both points. Thus the exact values of N and k given 
in Table 1.8 are more exact than could ordinarily be achieved in practical 
problems, while those given by the formulas are characteristic of the 
accuracy realizable in practice. Columns (6) and (7) of the table show 
the true values of pi and p 2 for which a = 0.01 and 0 = 0.10 when the 
approximate N and k are used. The differences between 0.05 and the 
values in column (6) and between the values in columns (3) and (7) 
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are a measure of the effect of the approximations. An alternative meas¬ 
ure would be the error in the true a and ff at the intended pi and p 2 ; this 
error would probably be smaller than that in pi and p 2 because the OC 
curves are usually fairly flat in the neighborhood of pi and p 2 . While it 
cannot be claimed that Table 1.8 gives a representative coverage of all 
kinds of plans, it can be claimed that, for those comparisons given, the dis¬ 
crepancy between the approximate and exact results is maximized. 

11.3. Computational Formulas 

The use of the acceptance criterion 

x + ks < U (164) 

requires the calculation of the sample mean x and the standard deviation 
s. The acceptance criterion can, however, be expressed in terms of 
simpler functions of the observations, namely, 

X = (sum of the observations) (165) 

Y = 2a; 2 (sum of the squares of the observations) (166) 

If k is positive, as it almost invariably will be, the lot may be rejected 
without regard to Y if 

X > NU (167) 


In this case, x exceeds U, so since s cannot be negative, x + ks exceeds U. 

The values of x and s for a sample of N observations can be calculated 
from 



II 

■h 

(168) 

and 




IN Sz 2 - (2z) 2 

8 V N(N - 1) 

(169) 

In terms of X and F, equations (168) and (169) can be written 



Hi 

II 

^1* 

(170) 

and 



INY - X 1 
s ~ yjN(N - i) 

(171) 

Replacing x and 

s in (164) by their values from (170) and (171) 

gives 


X + k INY-X > 

N ±IC ^N{N - 1) - 

(172) 

Subtracting X/N from both sides of inequality (172) and squaring gives 


NY-X* ^ 2 XU 

N(N - 1) /c -» U N + N* ■ 

(173) 
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Clearing fractions and rearranging terms gives 

Y < aX 2 - bX + c 

where 

_Nk* + N - 1 
° N 2 k 2 
h _ 2U(N - 1) 
JVifc 2 

_ Eg(g - 1) 
ifc 2 
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(174) 

(175) 

(176) 

(177) 


If inequality (174) is fulfilled, the lot is accepted; if inequality (174) is 
not fulfilled, the lot is rejected. 

Using the equality sign in (174) gives the equation of a parabola 


Ym * a* 2 - bX + c 


(178) 


which shows, for each vaiife of A", the maximum value of Y permitting 
acceptance, that is, the upper boundary of the acceptance region or the 
lower boundary of the rejection region. 

The minimum value of the curve Y R gives a value of Y below which 
the lot is acceptable without regard to X. Differentiating (178) with 
respect to X gives 

= 2aX - b (179) 


dY 

Setting equal to zero and solving for X gives 


Since 



d*Y„ 

dX 2 


= 2o > 0 


(180) 

(181) 


v # v 

X is the value of X at which the minimum value Y R of Y « occurs. Replac¬ 
ing X in (178) by the value of X given by (180) gives, after simplification, 


Y 

Yh = e - 


4 a 


(182) 


Substituting the values of a, b, and c given by (175), (176), and (177) into 
(180) and (182), and simplifying, gives 


i _ UN(N - 1) 
X ~ Nk 1 + N - 1 
^ _ U*N(N - 1) 
~ Nk* + N - 1 


(183) 

(184) 


The coordinates of the minimum point, X = X and Y = Yg, can be 
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approximated by replacing by N the X — 1 in the denominators of 
(183) and (184) 

Approximate X = (185) 

Approximate Y R = ^ (186) 


These approximate values are smaller than the exact values since they 
were obtained by increasing the denominators; if, therefore, 


Y < 


U\N - 1) 

k 2 + 1 


(187) 


Y will certainly be less than the minimum value of Y R} and the lot may be 
accepted without regard to X . 

The acceptance region has a lower boundary corresponding, for each 
value of X, with the minimum value cf Y algebraically possible. The 
sum of the squares of the deviations of the observations from their mean 
can be written 

2(* - x) 2 = 2z 2 - (188) 

or, in terms of X and Y 

2(* -xy=Y-~ ( 189) 


Since 2(x — x) 2 cannot be negative, 


Y 



(190) 


X 2 

For any given sum of observations A", the value of Y will be when the 


sum of the squares of the deviations of the observations from their mean 
is zero; that is, when all the observations are equal. The lower boundary 
of the acceptance region, or upper boundary of the error region, is, 
therefore, 



(191) 


A sample point lying below this line would indicate a computational 
error. 

By equating Y R and Y B given by (178) and (191), it can be shown 
that the lower and upper boundaries of the acceptance region meet at 
the point X = NU and Y — NU 2 . The two curves are tangent at this 
point. Beyond it, they again enclose an area similar to the acceptance 
region; this area beyond the point of tangency would be the acceptance 
region for a test in which k is negative. 
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The graphical procedure of Sec. 7.3.1 consists of drawing on a chart 
the upper and lower boundaries of the acceptance region, given by (178) 
and (191); a point representing the sample is plotted on this chart and the 
lot accepted or ^ejected according as the point lies within or above the 
acceptance region. This graphical procedure is impractical, however, 
when the observations are large. In this case, the range of variation in Y 
as X varies is large relative to the width of the acceptance region; and the 
upper and lower boundaries of the acceptance region are too close for 
most practical purposes. 

An adjustment of the observations can be made (a) to overcome this 
difficulty in the graphical procedure or ( b ) i.o find a criterion more effective 
than formula (187) in permitting immediate acceptance. The adjust¬ 
ment consists of subtracting an adjustment constant A from each of the 
observations, A being chosen according to the purpose (a) or (b) for 
which the adjustment is bein ; used. It Is not necessary to adjust each 
observation; it is necessary <>nly to compute the sum of the squares of 
the adjusted observations Y f from X and Y by 

Y' = X{X - A) 2 (192) 

= Y - 2.1 X + NA 2 


Adjusting Y R and Y E} given in (178) and (191), by (192) (that is, sub¬ 
tracting 2 AX and adding NA 2 ) gives the upper and lower boundaries of 
the acceptance region 


Y' r = aX 2 - (b + 2 A)X + (c + NA 2 ) 
Y' e = ^~ 2AX + NA* 


(193) 

(194) 


where a, 6, and c are defined by (175), (176), and (177). 

V 

It can be shown that the minimum value Y' R of Y' H is 


_ (£ - A)*N(N - 1) 
** ~ Nk* + N- 1 


(195) 


y 

Replacing by N the N — 1 in the denominator of Y R gives the simplified 
criterion for immediate acceptance 


y, Am - 1) 

1 - k 2 + 1 


(196) 


If the graphical procedure is used, the adjustment constant A should 
be chosen so as to minimize the variation in Y as X varies over the region 
of likely sample values. This flattens the curves defining the acceptance 
region, thus enabling the acceptance region to encompass a larger 
portion of the chart’s area and thereby improving the accuracy of 
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plotting. To determine the optimum value of A, consider the slope of 
Y' b at a given value of X 

g = f - 2A (197) 


For a given value of X, the slope is zero when A = X/N. The optimum 
value of A depends, therefore, on the observed value of X. If before 
the sample is inspected /*' is the best estimate of the mean of the char¬ 
acteristic being inspected, the optimum value of A, for graphical purposes, 

A = n' (198) 


since the expected value of X is Nn r . 

If the graphical procedure is not used, the adjustment constant A 
should be chosen so as to maximize the effectiveness of formula (196) in 
permitting immediate acceptance. This involves finding the value of A 
that minimizes the Y corresponding to the minimum value Y 9 = Y R . 
Differentiating (193) with respect to X gives 


Since 


dK 

dX 


dY\’ 


= 2 aX - (b + 2 A) 

®->o 


(199) 


( 200 ) 


setting equal to zero and solving for X gives the value of X at which 
Y 9 r is a minimum, namely, 

* = (201) 


The value of Y R calculated from (193) for the point whose abscissa is 
given by (201) is the minimum value of Y R 


h = - (b + J AV + (c + NA*) 


( 202 ) 


The value of Y R given by (195) can be obtained from (202) by replacing 
a, 6, and c by their values given by £175), (176), and (177). Finding the 
value of Y corresponding to 7' 
tracting NA 2 from, (202) gives 


Y r , that is, adding 2 AX to, and sub- 


Y = - + + (c + NA*) + 2 AX - NA* 

4a 

= - _|- c + 2AX 

4 a 


(203) 


Now, the optimum value of A is that value which makes the Y corre¬ 
sponding to the minimum value Y R as large as possible, since, for a given 
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value of X, if Y is less than the value given by (203), the lot is accepted. 
Differentiating (203) with respect to A gives 


dY _ (b ± 2 A) 


dA 


+ 2X 


Since 


2 

-T. = " r < 0 


(204) 


(205) 


dY 


dA 2 a 

setting ^ equal to zero and solving for A gives the value of A at which 
Y is a maximum, namely, 

(206) 




The value o* A given in (206) is a function of X, so the optimum value of 
A depends on the observed value of X. If before the sample is inspected 
m' is the best estimate of £, t|i* anticipated value of X is Af/x'. Replacing X 
in (206) by its anticipated Value gives 


A - oAV - 5 


(207) 


Replacing a and b in (207) by their values given by (175) and (176), and 
simplifying, gives 


_ (Nk 2 + N - 1) M ' — (AT — 1)1/ 
A ~ Nk 2 


(208) 


Replacing by N the N — 1 in the numerator of (208) gives 

Approximate A = — — ^ - (209) 

Using the value of A given by either (208) or (209) makes (196) very 
nearly the most effective in permitting immediate acceptance. In terms 
of 7, replacing the A in (203) by (207), and simplifying, gives the linear 
condition for acceptance 

Y < c' - b'X (210) 

where 

V = b - 2Nv'a . (211) 

c' = c - ATV 2 a ( 212 ) 

and a, 6 , and c are defined by (175), (176), and (177). 

11.4. Use of Non-central t Distribution 

11.4.1. Introduction.—The acceptance criterion 

x + k$ < U 


can be written 


(213) 

(214) 
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The quantity (J7 — x)/s is a function of the sample data and & is a 
constant; using the acceptance criterion (214) is thus equivalent to com¬ 
paring the value of the sample statistic (U — x)/s with a minimum level 
for acceptance fc. 

If many samples of N observations are chosen at random from the 
same population (or lot), the values of ( U — x)/s obtained from the 
various samples will vary. The proper determination of k requires a 
knowledge of the sampling distribution of (U — x)/s . 

The distribution of the quantity t, defined as 

t = VN (~^) (215) 

is known as the non-central t distribution, of which the familiar Student’s 
t distribution is a special case corresponding with Ex = U. Tables of the 
non-central t distribution with an explanation of their application to the 
problems considered in this chapter have been given by Johnson and 
Welch. 

11.4.2. Notation Used in the Tables. —It will be necessary to use the 
notation of Johnson and Welch to explain the application of their tables 
to the problems discussed in this chapter. The following list indicates 
the changes in notation:* 



This chapter 

Johnson and Welch 

Value of variable that defines qualitv.■ 

i 

V 

L 

Sample statistic. 

*. - ILzl 

u __ h - £ 

Acceptable level of statistic. 

s 

k 

s 

M 0 

Normal deviate corresponding with proportion 


p defective in lot. 

K P 

u 

Normal deviate corresponding with probability t 


■ 

of accepting a lot. 

h\ 

K 


* A word of explanation is in order about our departure from the Johnson-Welch 
and Jennett-Welch notation, since their notation is satisfactory and established. 
Throughout all three of our volumes the letter L is consistently used to represent the 
ordinate of an OC curve; as a substitute U has the mnemonic advantage of association 
with “upper limit.” It seems unnecessary to use two symbols for the normal deviate, 
since the type of subscript (Greek letter, numeral, and so forth) can discriminate 
between normal deviates defining quality and those describing operating charac¬ 
teristics. For the sample statistic, k accords with the lot parameter K. Logic suggests 
a subscript for the acceptable level of the statistic, and Johnson and Welch use o; but 
because the acceptable level enters the analysis so frequently and other values so 
seldom, we use no subscript. Logic also suggests a subscript for a particular observed 
value of the statistic, and we use o. 
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In the notation of Johnson and Welch, the acceptance criterion (214) 
becomes 

U> Uo (216) 

Let z be a quantity distributed normally about zero with unit standard 
deviation, and let w be a quantity distributed independently of z and 
in the same way as an estimate of variance computed from a sample 
from a normal population containing f degrees of freedom. That is, w 
is distributed as x V/ with f degrees of freedom. If t is defined by 


( = * + 5 


y/ w 


(217) 


where 8 is some constant, then t has the non-central t distribution, 
depending only on the eccentricity 8 and the number of degrees of freedom 
/ 

To see that the statistic i is closely related to the non-central t distri¬ 
bution, consider the distribution of values of u from random samples 
of N observations from a normal population with mean p and standard 
deviation or. Suppose that exactly a proportion p of the items in the 
population exceed a jriven limit L and that U is the standardized normal 
deviate exceeded with probability p, that is, 

1 C 00 r * 

— 7 = / e 2 dr = p (218) 

V 2 tt Ju 

Let x and 5 be the sample mean and standard deviation, respectively 
Then, in the notation of Johnson and Welch, y/N u can be written as 
(217) with 


— \/N(x — u) 

z = 

a 

(219) 

s~ 

W = 

<r i 

(220) 

f= N - 1 

(221) 

& = Vn u 

(222) 


Hence, y/N u is distributed as a non-central t with / = N — 1 and 
8 = y/N U. 

The distribution of non-central t requires a triple-entry table, since 
the probability that t exceeds a given level to depends on /, 6, and to. 
This probability is denoted by P(J,8,t 0 ). Johnson and Welch also use 
<(/,$,€) to denote that value of t 0 for which the probability P(f,6,to) 
takes a specified value e. The value of 8 that makes P(f,8,to) = € is 
denoted by 5(/J 0 ,«). 

11.4.3. Description of the Tables.—The problems for which the non¬ 
central t tables are used in this chapter require the evaluation of either 
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5(/,<«,*) or <(/,8,e). Table IV given by Johnson and Welch facilitates the 
direct calculation of these for 17 probability levels corresponding with « 
and 1 —t equal to 0.005, 0.01, and 0.025, 0.05, 0.1, 0.2, 0.3, 0.4, and 0.5. 
In Table IV, a quantity X = is given such that 


and 


HfA,e) = <0 — x yjl+Tjj, 

V 


t(J,8,t) = 


, . . ( , . 8 2 X 2 

a + x *v / 1 + 2f ~2f 


1 - 


X 2 

2 / 


(223) 

(224) 


Equations (42) and (21) of this chapter are essentially approximations for 
equations (223) and (224),' respectively, with X in (223) and (224) 
replaced by K tf the normal deviate exceeded with probability e. Even 
for small samples, it is found that X does not differ much from K t . 

Table IV consists of nine double-entry panels (one for each of the 
values of e listed above) showing X as a function of / and to. Values of X 
are given corresponding to / = 4, 5, 6, 7, 8, 9, 16, 36, 144, and qo. In 
order to simplify interpolation for intermediate values of/ greater than 9, 
the values of / = 9, 16, 36, 144 and oo are such that 12 /y/f = 4, 3, 2, 1, 
and 0, respectively; that is, X is tabled at equal intervals of Yl/y/J. 
The whole range of to from — qo to » is covered as follows: For < 0 /V^f 
between — qo and —0.75 or between +0.75 and + oo, X is tabled against 
the quantity 


1 



(225) 


For Jo/VW between —0.75 and +0.75, X is tabled against 


to 


(226) 


11.4.4. Calculation of 8(/,4,«).—For given /, to, and e < 0.5, the steps 
required for the calculation of 5(/,<o,«) are given below. For e > 0.5, 
the relation 

«(/,M = -«(/,-<»,1 - 0 (227) 

is used. That is, if 8(9,12,0.95) is required, calculate 8(9,-12,0.05) 
by the procedure outlined below and take the negative of this value as the 
required value of 8(9,12,0.95). 
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Step 1. 
Step 2. 


Calculate to/x^^f. 

1 


Find 

to' 


or 


Step 3. 
Step 4. 


V2f\ 


v'l + <5/2/ 

is greater than or less than 0.75. 




Step 5. 


If / is greater than 9, calculate 12/ Vj. 

From the panel of Table IV corresponding with the given e, 
obtain X = X(/,£ 0 ,e) by interpolating with respect to the 
quantities obtained in steps 2 and 3 . 

Calculate 8(f,t 0) e) =< 0 -\\ / l+ tf/2f. 

11.4.5. Calculation of <(/,$,«).■—The calculation of £(/,$,€) is accom¬ 
plished by an iterative method, since X is given ir Table IV as a function 
of to. The sxiccessive steps in the iterative method for calculating /(/,5,c) 
for given /, 5, and e<0.^e gi^en below. For e > 0.5, the relation 

KM,*) - -*(/,-«, 1 ~ 6) (228) 

is used. That is, if £(9,5,0.90) is required, calculate £(9, —5,0.10) by 
the procedure outlined below, and take the negative of this value as the 
required £(9,5,0.90). 

Step 1. Determine K e , the normal deviate exceeded with probability 
€, which can be found in Table 1.1. 

Calculate the first approximation h from 


Step 2. 


5 + 


ti = 


K.yj 1 


fi 2 

2 / 


1 H-- 


jZ 

2 / 


Step 3. 
Step 4. 


Step 5. 
Step 6. 


Step 7. 


Calculate t\/y/2f. 

1 

2 / = 


2 / 


Find 


£i 


Vi + W 


or y = 


= <i/VV 
Vi + <?/2/ 


according as 


VV\ 


is greater than or less than 0.75. 


If / is greater than 9, calculate 12/V7* 

From the panel of Table IV corresponding with the given 
e, obtain Xi = X(/,£i,e) by interpolating with respect to the 
quantities obtained in steps 4 and 5. 

Calculate a second approximation U from 




S 2 X? 
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Step 8 . Repeat steps 3 through 7, replacing t\ by £ 2 , and calculate a 
third approximation £ 3 . Repeat these steps until two suc¬ 
cessive approximations are the same; usually not more than 
two approximations are required. Take for $(/,$,€) the value 
so obtained. 

11.4.6. Illustration of the Use of the Non-central t Tables in Specific 
Problems 

Problem I. Given: N, p X) e (in our notation, N, pi, 1 — a or 

Ny Ply 0) 

Required: Uq (in our notation, k) 

A single-sampling plan based on variables is defined by a sample size 
N and a value u 0 of the statistic u. If N is fixed, u 0 can be determined so 
that the operating-characteristic curve for the plan passes through a 
specified point (pi,e); that is, for a fixed sample size, a plan is chosen for 
which the probability of accepting lots of quality p x is c. 

For given N , pi, and €, the value u 0 that defines the plan is given, in 
the notation of Johnson and Welch, by 

V^Nuo = t(N - 1,\/JV U i,e) (229) 

where U 1 is the normal deviate exceeded with probability p x% Thus, for 
a given sample size AT, quality (fraction defective) p h and probability 
of acceptance €, the determination of Uq requires the calculation of 
*(/,$,€) where / = JV — 1 and 5 = y/NU\. 

An outline of the method for determining u 0 for given values of A, 
pi and e is given below. 

Given: N = sample size 

pi = a proportion defining quality 
e = desired probability of accepting lots of quality pi 

Determine U 1 , the normal deviate exceeded with probability 
pi, from Table 1.1. 

Calculate 

/= N - 1 
5 = y/NUi 

Step 3. Calculate *(f,5,€) by the method described in Sec. 11.4.5. 

Step 4. Calculate u 0 =J(/,5,*)/VA. 

Problem II. Given: N y u 0 , c (in our notation, N } fc, L p ) 

Required: p 

For every single-sampling plan (defined by N and u 0 ), there is an 
operating-characteristic (OC) curve which shows the probability c of 


Step 1. 
Step 2. 
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accepting a lot of quality p. From the Johnson and Welch tables we can 
compute the values of p corresponding to e or 1 — e equal to 0.005, 0.01, 
0.025, 0.05, 0.1, 0.2, 0.3, 0.4, or 0.5. This requires finding 8(f,to,e) where 
f = N — 1 and to ~ \^NUo. The value of U corresponding to this value 
of 8 is 8/y/N. The value of p corresponding to the given c is then the 
probability that a normal deviate will exceed this U , which may be found 
from Table 1.5. 

Step 1. Calculate 

to = ‘S/'Nuq 

f=-N- 1 

Step 2. Calculate 5(/,<o,c) by the method described in Sec. 11.4.4. 

Step 3. Calculate L *=• 8(f.to,^ ■' \/N . 

Step 4. Obtain the req iir d p by entering Table 1.5 with K equal to 
the value of U found in step 3. 

11.4.7. Work Sheets.— Suppose that the sample size is fixed at 30, 
and a plan is required for which a lot having 0.1 percent defective will 
be accepted with probability 0.99. That is, N = 30, pi = 0.001, and 
€ = 0.99. Work sheets 1, 2, and 3 (Tables 1.9, 1.10, and 1.11) represent 
the calculations necessary to determine u Q ; work sheet 4 (Table 1.12) 
represents those for the OC curve. The numerical entries have been 
rounded from calculations originally carried to more decimal places, so 
they do not always agree exactly with operations carried out on the 5-deci- 
mal-place figures shown. 


TABLE 1.9 

Work Sheet 1 foe Determining u 0 (in Our Notation k) from Non-central t 

Tables 


How number 

(1) 

Operation 

(2) 

Itesult 

(3) 

Example 

(4) 

(1) 

Given 

Pi 

0.001 

(2) 

Given 

C 

0.99 

(3) 

Given 

N 

30. 

(4) 

Table 1.1 

u, 

3.00023 

(5) 

V(3) 

Vn 

5.47723 

(6) 

(3) - 1 

w-l 

1 

« 

29. 

(7) 

(4) X (5) 

s - Vn v x 

16.92590 

(8) 

Sheet 2, row (14), 
last column 

KfM 

12.47223 

(9) 

(8) + (5) 

«*0 ™ 

Vn 

2.27711 
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TABLE 1.10 

Work Sheet 2 for Determining u 0 —Computation of t(f,6,e) 


How 

Operation 

Result 

First 

Second 

number 

approximation 

approximation 

u) 

(2) 

(3) 

(4) 

(5) 

a) 

Sheet 1, line 6 

/ 

29 


(2) 

Sheet 1, line 7* 

5 or —5 

-16.92590 


(3) 

Sheet 1, line 2* 

e or 1 —e 

0.01 

<— 

(4) 

2 X (1) 

2/ 

58 

«- 

(5) 

(2) a 

5 2 

286.48606 

4— 

(6) 

(5) -h (4) 

5* 

2/ 

4.93941 

- 

(7) 

Table 1.1 

K e 

2.32635 

2.321t 

(8) 

(7)* 

K ,* 

5.41190 

5.38704 






w 

(8) -5- (4) 

~v 

« 2 K.* 

0.09331 

0.09288 

(10) 

1 + (6) - (9) 

1 H-- 

2/ 2/ 

5.84611 

■ 

5.84654 


vTio) 

/ 5* K e 2 



(ID 

V 1 + v ~ W 

2.41787 

2.41796 



1 a 2 if, 2 



(12) 

(2) + (7) X (11) 

s+ ^^ 1+ ___ 

-11.30109 

-11.31381 

(13) 


/C e * 



1 - (9) 

i —• 

2 f 

0.90669 

0.90712 

(14) 

(12) 4- (13) 

t = *(/,»,.) 

— 12.46410 

-12.47223 

(15) 

VW) 

s/2S 

y/2f 

7.61577 

- 

(16) 

(14) - (15) 

-1.63662 

-1.63768 

(17) 

(14) 2 

t 2 

155.35379 

155.55660 

(18) 

(17) -f- (4) 

t* 

2f 

2.67851 

2.68201 

(19) . 

1 + (18) 

*+& 

3.67851 

3.68201 

(20) 

vm 

+ 

1.91795 

1.91886 

(21) 

1 -f- (20) 

*-'AK 

0.52139 

0.52114 

(22) 

(16) -J- (20) 

'-■a/J'+i 

t 

X 

(23) 

VU) 

VS 

5.38516 

4— 



12 



(24) 

12 + (23) 

vs 

2.22834 

4— 

(25) 

Sheet 3, row (17), 

X 

2.321 

2.321 


last column 





* Since • > 0.5, wo change the sign of 6 and replace « by 1 — 

t The figure in row (7), column (5) is the value of Xj given in row (25), column (4) and is used in 
place of Kg in the succeeding calculations for Xt. 

t No figures are given for y ' in this example because <o/\/2/ > 0.75. 
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TABLE 1.11 


Work Sheet 3 for Determining u 0 —Linear Interpolation for X(a,6) 


Row 

number 

Operation 

Result 

First 

approximation 

Second 

approximation 

a) 

(2) 

(3) 

(4) 

(5) 

a) 

Sheet 2, line 21* 

a 

0.52139 

0.52114 

(2) 

Sheet 2, line 24* 

b 

2.2283 

2.2283 

(3) 

Table IV* 

ai 

0.5 

0.5 

(4) 

Table IV* 

a% 

0.6 

0.6 

(5) 

Table IV* 

bi 

3. 

3. 

(6) 

Table IV* 

b* 

2 

2. 

(7) 

(3) - (1) 

oi — a 

Ci —- 

0.4139 

0.2114 

(3) - (4) 

Cll ~ 02 



(8) 

(1) - (4) 

a — a 2 
f 2 - 

0.7861 

0.7886 

(3) - (4) 

Oi ~ Oi 



(9) 

(5) - (2) 

(5) - (6) 

, , bi - b 

l ' “ 6, - 6, 

0.7717 

0.7717 

(10) 

(2) - (6) 

(5) - (6) 

. 6-6, 
dt “ 67 —T, 

0.2283 

0.2283 

UD 

Table IV« 

^11 

2.318 

<_ 

(12) 

Tabic IV* 

A12 

2.324 

♦_ 

(13) 

Table IV* 

x 2l 

2.310 

♦_ 

(14) 

Table IV* 

X 2 2 

2.318 

♦- 

(15) 

(8)(11) + (7)(13) 

C 2 X 11 -|- C 1 X 21 

2.31629 

2.31631 

(16) 

(8)(12) + (7)(14) 

C2X12 + CiX 22 

2.32272 

2.32273 

(17) 

(10)(15) + (9) (16) 

X(a 6 ) 

— <^ 2 ( 02 X 11 + C 1 X 21 ) 

-h d}(C'i\vi 4* C 1 X 22 ) 

2.321 

2.321 


* For the given «, X is a function of/and lo and is given in Johnson and Welch’s Table IV, for specified 
values of y (or y') and / (or \ 2 /y/f). a and b are the values of the arguments (in this case y and 12/ \/f, 
respectively) for which X is required, a i, at, b i, and bt are the values of the arguments for which Table 
IV gives X values and between which we interpolate to find X(a,6). Xu, X12, Xsi, and Xu are the tabular 
values corresponding to a i, at, bi, and 62. The following scheme explains the notation; 



The computation of the OC curve for the above plan is illustrated by 
finding the values of p for Which L p is 0.2 and 0.7. The figures given in 
Table 1.12 represent the calculations necessary to determine the values of 
p corresponding with these two values of L p . Additional columns would 
be required for additional points on the OC curve. 
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TABLE 1.12 


Work Sheet 4 for Operating-characteristic Curve 


(1) 

• (2) 

(3) 

(4) 

(5) 

Row 

number 

Operation 

Result 

First point 

Second point 

a) 

Given 

L r 

0.2 

0.7 

(2) 

Smaller of L p and 

e 

0.2 

0.3 


1 -L, 




(3) 

Sheet 3, line 1, last 

a 

0.52114 

<- 


column 




(4) 

Sheet 3, line 2, last 

b 

2.2283 

<— 


column 




(5) 

Sheet 3, row 3* 

a t 

0.6 

0.5 

■ (6) 

Sheet 3, row 4* 

a* 

0.5 

0.6 

(7) 

Sheet 3, row 5 

bi 

3. 

<- 

(8) 

Sheet 3, row 6 

hi 

2. 


(9) 

(5) - (3) 

CL\ — Ct 

Ci — - 

0.7886 

0.2114 

(5) - (6) 

a i — a 2 



(10) 

(3) - (6) 

CL — <Z2 

Ct — 

0.2114 

0.7886 

(5) - (6) 

a i — at 



(11) 

(7) - (4) 

(7) - (8) 

. 6.-6 
dl= b 7-T, 

0.7717 

- 

(12) 

(4) - (8) 

(7) - (8) 

, 6-6, 

,h = bT=bt 

0.2283 

<- 

(13) 

Table IVt 

Xxi 

0.9125 

0.4318 

(14) 

Table IVf 

Xl2 

0.8899 

0.4632 

(15) 

Table IVf 

X21 

0.9192 

0.4408 

(16) 

Table IVt 

X 2 2 

0.8945 

0.4691 

(17) 

(10)(13) + (9) (15) 

C 2 X 11 4* C 1 X 21 

0.91778 

0.43370 

(18) 

(10)(14) + (9) (16) 

C 2 X i» + C 1 X 22 

0.89353 

0.46445 

(19) 

(12)(17) + (11) (18) 

\(ab) 

=* ^ 2 ( 02 X 11 4- C 1 X 21 ) 
4 <^( 02 X 12 4 C 1 X 22 ) 

0.8991 

0.4574 

(20) 

Sheet 2, row 14, last 

to 

12.47223 

— 12.47223 


column t 




(21). 

Sheet 2, row 20, last 

1 tl 

1.91886 


column 

\ + V __ 


(22) 

(20) - (19) X (21) 

, “- x V i +| 

10.74699 

-13.34992 

(23) 

(22) if € - L p 
— (22) if « - 1 - L p 

s 

10.74699 

13.34992 

(24) 

Sheet 1, row 5 

Vn 

5.47723 

<- 

(25) 

(23) + (24) 

u “ VF 

1.96212 

2.43735 

(26) 

Table 1.5 

p 

0.0249 

0.0074 


* The interchange of ai and os in column (4) is made for consistency with the interpolation scheme of 
sheet 3. 

t Use the panel of Table IV that corresponds with the value of < given in row (2). Note that the 
portion of Table IV where t is positive is used for column (4) and the portion where t is negative is used 
for column (5). 

If only N and t»o are given and no preliminary calculations are available from work sheets for 
determining uo by the ezaot method, then sheet 4 must be supplemented by a sheet consisting of rows 
(14) through (34) of sheet 3, with t taken as y/TFu* and /taken as N — 1. 
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11.5. Construction of Table 1.2 

Problem. Given: pi, p 2} a, and 0 
Required: N and k 

A single-sampling plan based on variables is defined by a sample 
size N and a value k , the multiple of the standard deviation in the criterion 
x + ks < U for accepting the lot, where U is the value above which the 
characteristic being inspected is regarded as defective. If no restrictions 
are imposed on sample size, a plan can be constructed for which the 
operating-characteristic curve will pass through two specified points. In 
practice, the first point often corresponds with a lot quality pi (AQL) for 
which it is desirable that the probability of rejection should be only a 
(producer’s risk), and the second point corresponds with a lot quality 

(LTPD) for which it is desirable that the probability of acceptance 
should be only 0 (consumer’s risk). 

Consider the family of all OC curves that pass through the point 
(pi,l — a) but which have differing values of N. If N is small these 
curves decline gradually as p increases from p h and if N is large they 
decline sharply. There will be one OC curve among the family that 
passes closer than any other curve to the point (p 2y 0). Since N takes 
only integral values, there usually will be no curve that passes exactly 
through (p 2y 0). A curve could, however, be made to pass exactly through 
(p 2y 0) if it were allowed to deviate from (pi,l — a). Table 1.2 has been 
constructed in such a way that the OC curves generally pass not exactly 
through either (pi,l — a) or (p 2y 0) but very close to both. The distance 
of the OC curve from the intended two points has been measured hori¬ 
zontally, that is, by the difference between the actual and intended 
qualities for which the probabilities of acceptance are 1 — a and 0. 

The values of N and k in Table 1.2 were obtained in three stages: 
(1) obtaining a first approximation to N and A;; (2) determining the degree 
to which the preliminary values of N and k satisfy the requirement that 
the OC curve pass through (pi,l — a) and (p 2 , 0); and (3) adjusting N 
or k to improve the approximation. 

Step 1. From p iy p 2y a , and 0, determine K h K 2 , K a , and Kp, the 
normal deviates exceeded with probabilities p\ y p 2) a , and 0, 
respectively, as given by Table 1.1. 

Step 2. Calculate the first approximation to k from 

_ K a K 2 + KpK x 
K “ K a + Kp 

Step 3. Calculate an approximate sample size N from 

„ _ W + 2(K a + K a Y 

* ~ 2 \Kx - KtJ 
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Step 4. By the method given in Sec. 11.4 for determining the quality 
p corresponding to a specified probability of acceptance L p , 
determine pi_« and pp where 

Pi-a = the quality for which the probability of acceptance 
is 1 — a 

pp = the quality for which the probability of acceptance 
is p 

If pi_« and pp are sufficiently close to p\ and p 2 , respectively, 
then the values of k and N given by steps 2 and 3 define a 
satisfactory sampling plan. 

Step 5. If the discrepancy between p i_ a and p i or that between pp and 
pi is too large, determine a better plan by adjusting either N 
or k according to the following rules: 

a. If pi- a is greater than p i and pp is less than p 2 , take the 
next lower integer for N and the same k, and repeat step 4. 

b. If p i_ a is less than p\ and pp is greater than p 2} take the 

next higher integer for N and the same fc, and repeat 
step 4. 

c. If p i —a is less than pi and pp is less than p 2 , use the same 
N and a smaller value of k } and repeat step 4. 

d. If p i — a is greater than pi and pp is greater than p 2y use 
the same N and a larger value of k , and repeat step 4. 

These rules can be applied regardless of the method used for finding 
the first approximation to k and N. When these are found by steps 2 and 
3 , however, rule d will usually be applicable. 

Calculate the better approximation for A; mentioned in (c) and ( d ) 
from 

7 _ X \Ki + \ 2 Ki 

“ Xi + \ 2 

where 

Xi = value of X computed in determining pi_« in step 4 
X 2 = value of X computed in determining pp in step 4 

Step 6. For the required N and k f take the pair of values for which 
and pp are close enough, or closest, to the given values of 
pi and p 2y respectively. 

Table 1.2, which gives values of N and k for various combinations of 
Pi and p 2 when a = 0.05 and = 0.10, was constructed by essentially 
the method described above. For each plan in Table 1.2, except that 
for pi = 0.05 and p 2 = 0.40, the true values of pi and p 2 agree exactly 
with the intended values when rounded to the number of decimal places 
with which the intended p\ and p 2 are given in the table; in many cases 
the agreement is much better. Minor improvements could, no doubt, be 
made in many cases. 
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11.6. Sequential Analysis of Proportion Defective by Variables* 

The fact that 

z = x + ks (230) 

can be taken as Approximately normally distributed with mean and 
standard deviation approximately 

Z = n + hr (231) 

and 

<j(N) = *F(N) (232) 

where 

F(JV ) = V-V + (233) 

may be made the basis of a ^‘qvunitial procedure for controlling percent 
defective by means of variables. 

For a lot of acceptable quality, that is, one in which the mean of z 
is 7j\ as defined by (155), the probability density of z is given by 

1 (»-z o a 

P( * = -^T7f e 2 ' W) ~ (234) 

v(N) s/2tt 

Similarly, for a lot of unacceptable quality, that is, one in which the mean 
of z is Z 2 as defined by (156), the probability density of z is 

P(z\Z 2 ) = _±-— e (235) 

<t(N) v2tt 


A sequential test is made by computing X after each observation, 
where 


and rejecting the lot if 
accepting it if 


X = log. 


P(z\Zj) 

P(z\Z l ) 


\>a 


\ < -b 


(236) 

(237) 

(238) 


* See Chap. 17 for a brief account of sequential analysis and for references to the 
fundamental work on this subject, the theory of which was developed in the Statistical 
Research Group by A. Wald. An exact treatment of proportion defective problems by 
sequential analysis of variables involves the ratio of densities of two non-central 
t distributions. Henry Goldberg has shown that this may be expressed in terms of a 
confluent hypergeometric function, tables of which were prepared for the Statistical 
Research Group by the New York Mathematical Tables Project in a form suitable 
for use in sequential analysis. There is in preparation an additional section for 
Sequential Analysis that will include these tables and an explanation of their use. 
The present section simply outlines a sequential procedure based on the approxima¬ 
tions used elsewhere in this chapter for single sampling. 
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or taking another observation if 


where 


-b < X < a 


a = loge 


1 


- jg 
a 


b 



(239) 

(240) 

(241) 


a being the probability of rejecting a lot of quality Z 1 and /? the proba¬ 
bility of accepting a lot of quality Z 2 . Each calculation of X is based on 
all observations. 

To obtain X, we take the logarithm of (234) and subtract it from the 
logarithm of (235), which yields 


X = 


1 


2[<r(X)] 


\{z - ZiY -{z-Z. 


iq _ 7j 2 — Z\ / Z x + Z 2 \ , 0 . 0 . 

!) 1 “ W)T V-2 7 (242) 


Substituting for Z\ and Z 2 from (155) and (156) and for <r{N) from (232), 
and rearranging, gives 


X, - K 2 r _ r _ /, Ky_± XAl 

x <r[F(N)Y [ 1 a \ 2 ' ‘ )\ 

which may be rewritten as 


z = U + <7 


L^i - k 2 

This will be independent of a if 


\mw _ K i± a j 


+ k 


_ X, + X 2 _ Xf/'’(X)] 2 
2 X, - X 2 


(243) 


(244) 


(245) 


On substituting for F(N) from (233), there results a quadratic in k for 
which the solution is 


k = 


Vl - AB - 1 


where 


and 


.4 = 


{N - 1 )(Xx - K 2 ) 


(246) 

(247) 


o\ 

B = NjKx - X 2 ) ~ ( ' Kl + (248) 

For each value of N, two values of k are obtained from (246), the 
first, k K , by replacing X in (247) and (248) by a, and the second, k A , by 
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replacing X by — b , where a and b are given by (240) and (241).* Then 
after each observation both 


and 


Zr = X + k R s (249) 

Za = x + k A s (250) 


are computed. If z R > U the lot is rejected. If Za < U the lot is 
accepted. If neither condition is fulfilled, another item is inspected, 
new values of x , s, k R , and k A are computed from all the observations, and 
new values of z R and z A are compared with U. 

An alternative computational procedure is to compute k Q after each 
observation, where 

h « - (251) 


then if k 0 > k A the lot is accepted, if k„ < k R the lot is rejected, and if 
k A > k 0 > k R another item is inspected. 

The values of k A and k R depend upon N, vi> p 2 , <*> and p. In any one 
problem all of these but N are fixed, so a table showing k A and k R for various 
values of N can be prepared in advance. If pi = 0.15, p 2 = 0.30, a = 0.01 
P = 0.02, and N = 95, the values of k R and k A are 0.6651 and 0.8930, 
respectively.! For a single-sampling plan, the corresponding value of 
k was found in Sec. 4.2.2 to be 0.7645. For the sequential plan, if no 
decision had been reached from the first 94 observations, rejection would 
occur if for the first 95 observations x + 0.6651s > U and acceptance 
would occur if x + 0.8930s < U; if neither condition were fulfilled, another 
observation would be made. From the data in Sec. 7.2.2, where x 
= 30.232, s = 5.000, and U = 35, we find z R — 33.56, which, being less 
than 35, does not call for rejection; and z A = 34.70, which, being less 
than 35, calls for acceptance. 


12. NOTATION 

The arrangement is alphabetical, the sequence of listing under each 
letter being as follows: first Roman, then Greek letters; within these 

* Tables of a and b in terms of a and 0 are givm in Statistical Research Group, 
Sequential Analysis, Tables 4.2, 5.1, 6.1, and 7.31. These give a and b to 3 decimal 
places for values of a and 0 of 0.001, 0.01 to 0.05 by steps of 0.01, 0.05 to 0.20 by steps 
of 0.05, and 0.20 to 0.40 by steps of 0.10. 

t From the unpublished tables referred to in the footnote at the beginning of Sec. 
11.6, which are based on the likelihood ratio computed from two non-central t distribu¬ 
tions, the corresponding values of k R and k A are found to be 0.6630 and 0.8893. For 
k R * 0.6651 and k A = 0.8930, the true values of a and 0 are, respectively, 0.0110 and 
0.0177, which differ by 9.6 and 11.7 percent from the intended values, 0.01 and 0.02. 
Thus, these two OC points lie below (to the left of) those intended, whereas with 
single sampling the true points generally lie above (to the right of) those intended 
(see Table 1.8). 
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groups, first capital, then small letters; within these groups, first those 
without, then those with, primes, bars, or carets; within these groups, first 
those without subscripts, next those with literal subscripts (following 
the same sequence as for letters), and finally those with numerical 
subscripts (in numerical order). Certain symbols, notably a and b , are 
used in a variety of different meanings that occur only locally; in such 
cases the listing is in order of first appearance. Section 11.4 employs the 
notation of Johnson and Welch, which differs from (and in some respects, 
for example, U and L, contradicts) that used elsewhere in the chapter. 


Symbol 

S eo 

Meaning 

A 

p. 37 

An adjustment constant subtracted from 
each observation 

A 

p. 61 

Critical value of z for accepting or rejecting 
a lot 

A 

p. 84 

equations (246), (247) 

A function of X, N, K if and K 2 that appears 
in formulas for the sequential procedure 

AQL 

pp. 11, 45 

Acceptable quality level (see p x ) 

a 

pp. 19, 20 
equations (21), (22) 

Function of N and K a that appears in the 
formula for k when N is fixed 

a 

p. 35 

equations (55), (56) 

Coefficient of quadratic term in condition, 
quadratic in X , for acceptance in terms 
of Y 

a 

p. 58 

equation (118) 

Multiple of <r in formula for mean of dis¬ 
tribution of 8 

a 

p. 61 

equations (136), (137) 

Function of Ki, K 2 , K aj and Kp that 
appears in the formula for N 

a 

pp. 83, 84 
equation (240) 

A critical value of X in a sequential test on 
the basis of which the lot is rejected or 
another observation is taken 

at 

p. 16 

Producer’s risk. Probability that a lot of 
acceptable quality (pi or AQL) will be 
rejected 

B 

p. 84 

equations (246), (248) 

A function of X, AT, Ki, and K 2 that appears 
in formulas for the sequential procedure 

b 

pp. 19, 20 
equations (21), (23) 

Function of AT, K\, and K a that appears in 
the formula for k when N is fixed 

b 

p. 35 

equations (55), (57) 

Negative of coefficient of linear term in 
condition, quadratic in X, for acceptance 
in terms of Y 

b 

p. 58 

equation (119) 

Multiple of a/\/2(N — lj in the formula 
for standard deviation of distribution of 8 

b 

p. 61 

equations (136), (138) 

Function of k and a that appears in the 
formula for N 

b 

pp. 83, 84 
equation (241) 

A critical value of X in a sequential test on 
the basis of which the lot is accepted or 
another observation is taken 
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Symbol 

See 

Meaning 

bM 

pp. 53, 54 

Value of b obtained from equation (57) by 


equations (100), (103) 

replacing U by M 

b v 

pp. 53, 54 

Same as b given by equation (57); subscript 


equations (100), (103) 

U used to distinguish bu from b^ 

b' 

p. 41 

Negative of coefficient of linear term in 


equations (78), (79) 

condition, linear in X , for acceptance in 
terms of Y 

fi 

p. 16 

Consumer's risk. Probability that a lot of 
unacceptable quality (y >2 or LTPD) will 
be accepted 

c 

p. 35 

Constant term in condition, quadratic in X, 


equations (55), (58) 

for acceptance in terms of Y 

CM 

pp. 53, 54 
equation (104) 

Value of c obtained from equation (58) by 
r eplacing U by M 

cu 

pp. 53, 54 

I Same is c given by equation (58); subscript 


equations (IQ0., (104* 

U used to distinguish cu from Cm 

c* 

p. 41 

Constant term in condition, linear in X, for 


equations (78), (80) 

acceptance in terms of Y 

y 

p. 43 

A proportion (0 < y < 1) denoting a 
confidence coefficient 

y e 

p. 44 

Confidence coefficient used in judging 
whether to continue normal inspection 

y r 

p. 44 

Confidence coefficient used for judging 
whether to resume normal inspection 

8 

p. 73 

Johnson and Welch notation for a constant 
appearing in the numerator of non-central 
t; 8 - y/NU 


p. 73 

Sec. 11.4.4 

Johnson and Welch notation for the value 
of 8 that makes P(f,8,t 0 ) — c 

€ 

p. 17 

equation (9) 

A probability (0 < « < 1); probability 
that a normal deviate will exceed K t 


p. 73 

Johnson and Welch notation for probability 
of acceptance 

F(N) 

p. 61 

Any function of N that is independent of c 

f 

p. 73 

Degrees of freedom of x* 

h 

p. 42 

Number of lots from which samples are 
taken and combined into one large sample 
for the purpose of computing the process 
average 

K 

p. 9 

(U — p)/<r. That is, the difference be¬ 
tween the upper limit and the population 
mean, as a multiple of the population 
standard deviation 

K a 

p. 17 

Normal deviate exceeded with prob¬ 
ability a 

Kfi 

p. 17 

Normal deviate exceeded with prob¬ 
ability P 

K . 

P. 17 

Normal deviate exceeded with prob¬ 
ability c 
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Symbol 

See 

Meaning 

K„ 

p. 21 

Normal deviate exceeded with prob¬ 
ability po 

K x 

p. 17 

Normal deviate exceeded with prob¬ 
ability pi 

K t 

p. 17 

Normal deviate exceeded with prob¬ 
ability p 2 

k 

p. 12 

Coefficient of the sample standard devia¬ 
tion in the acceptance criterion 
x + ks < U; 

also, acceptable level of statistic k 0 

k A 

p. 84 

Acceptance level for k 0 in a sequential test 

k 0 

p. 43 

(U — x)/s. That is, the observed differ¬ 
ence between U and the sample mean in 
units of the sample standard deviation; 
sample estimate of K 

k R 

p. 84 

Rejection level for k 0 in a sequential test 

k* 

p. 48 

\Jn times the coefficient of the sample 
standard deviation in the acceptance 
criterion, x + ( k'/y/n ) s < £/', used when 
items are eventually assembled in sets of n 

ty 

p. 47 

llDper confidence limit for K, based on a 
confidence coefficient of y 

ky e 

p. 45 

tower and upper limits, respectively, 
between which a computed value of k 0 
leads under normal inspection to con¬ 
tinuing normal inspection, based on a 
confidence coefficient of y, 

hy ry tiy. 

p. 45 

tower and upper limits, respectively, 
between which a computed value of k 0 
leads under reduced or tightened inspec¬ 
tion to resuming normal inspection, based 
on a confidence coefficient y r 

L 

p. 72 

Johnson and Welch notation for the upper 
limit of a variable, above which an item is 
defective; corresponds with our U 

LTPD 

pp. 22, 45 

Lot tolerance percent defective (see p 2 ) 

L p 

p. 15 

Ordinate of an OC curve at abscissa p. 
The probability that a lot of fraction 
defective p will, if submitted to a given 
acceptance test, be passed 


p. 16 

Value of an OC curve at m, <r. The prob¬ 
ability that a lot of mean n and standard 
deviation <r will, if submitted to a given 
acceptance test, be passed 

In 

p. 61 

Probability that a lot having mean Z\ will, 
if submitted to a given acceptance test, 
be passed 
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Symbol 

See 

Meaning 

U 

p. 62 

Probability that a lot having mean Z 2 will, 
if submitted to a given acceptance test, be 
passed 

X 

p. 83 

Natural logarithm of the ratio of two 
probability densities, computed after each 
observation in a sequential test 

x = \(fM 

p. 74 

equations (223), (224) 

Johnson and Welch notation for a function 
of /, to , and « used in the calculation of 
$(/,<o,€) and <(/,«,€> 

Xi 

p. 82 

Value of X com puted in determining pi_ a 

X2 

p. 82 

Value of \ computed in determining pp 

M 

p. 13 

Minimum value of a variable, below which 
an item is defective 

M 

r . 9 

Population mean of a normally distributed 
variable 

t 

P 

p. 41 

Best estimate of £ available before the 
sample is inspected 

N 

p. 12 

Number of items in a sample 

Na 

p. 26 

Sample size for single-sampling plan for 
percent defective based on atributes 

Ni 

p. 43 

Number of items in the ith sample of a 
group of h samples 

N m 

p. 27 

Sample size for single-sampling plan for 
percent defective based on the mean when 
the standard deviation is known 

N. 

p. 28 

Sample size for single-sampling plan for 
percent defective based on the standard 
deviation when the mean is known 

N t 

p. 27 

Sample size for single-sampling plan for 
percent defective based on the mean and 
standard deviation when both are un¬ 
known but estimated from sample 

N' 

p. 48 

Number of sets of n items in a sample 

n 

p. 48 

Number of items constituting a set which 
is judged defective or not on the basis of 
the average for the set 

n 

p. 50 

Number of variables upon which the 
quality of an item depends 

OC 

Sec. 3.4 

Sec. 6 

Operating characteristics. The probability 
that a given acceptance test will pass lots 
of various qualities if submitted 

PA 

p. 42 

Process average percent defective 


p. 73 

Johnson and Welch notation for the prob¬ 
ability that t exceeds a given level U 
for given / and d 

P{z\Z0 

p. 83 

equation (234) 

Probability density of z for lot in which the 
mean of z is Z x 


p. 83 

equation (235) 

Probability density of z for lot in which the 
mean of z is Z* 
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Symbol 

See 

Meaning 

P 

p. 15 

Fraction defective (quality) of a lot 

Pi-« 

p. 82 

The quality for which the probability of 
acceptance is 1 — a with a given sampling 
plan 

Pfi 

p. 82 

The quality for which the probability of 
acceptance is 0 with a given sampling plan 

Po 

pp. 20, 54 

Indifference quality, or lot fraction defec¬ 
tive for which acceptance and rejection 
arc equally probable 

Pi 

p. 16 

Lot fraction defective for which it is desir¬ 
able that the probability of rejection be 
at most a ; (see AQL) 

P* 

p. 16 

Lot fraction defective for which it is desir¬ 
able that the probability of acceptance be 
at most /3; (see LTPD) 

P' 

pp. 30, 33 

Lot fraction defective for which the prob¬ 
ability of acceptance {L p >) is 1 — L p 

Two parts ( p[ < p") into which p 0 is 
arbitrarily divided, that is, p r 0 + p" = p 0 

Po) Po 

pp. 55, 64 

p 

p. 43 

Point estimate, corresponding with a con¬ 
fidence coefficient of zero, of the process 
average percent defective. Probability 
that a normal deviate will exceed k Q 

K py 

p. 43 

Lower and upper confidence limits, respec¬ 
tively, for fraction defective where y is the 
confidence coefficient 

S 

pp. 30, 32 

\/l /N 4- fc 2 /[2(iV — 1)]. Quantity used in 
computations for OC curve 

8 

p. 12 

equations (5), (6) 

Sample estimate of the standard deviation 
of a variable 

8{ 

a - i, • • • , h) 

p. 43 

Sample estimate of the standard deviation 
of a variable for the ith sample of a group 
of h samples 


p. 12 

equations (5), (6) 

Sample estimate of the variance (squared 
standard deviation) of a variable 

2 

p. 12 

equations (4), (5) 

Summation over a sample. The variable 
following 2 is to be summed over the 
sample 

*<r 

p. 9 

Population standard deviation of a nor¬ 
mally distributed variable 

«(N) 

p. 61 

equation (139) 

p. 72 

Population standard deviation of the nor¬ 
mally distributed variable z, a function of 
the sample size N 

A variable having the non-central t distri¬ 
bution; in this chapter, used to desig¬ 
nate the variable y/Nu = ( y/N/s)(L — £) 

t(fM 

p. 73 

Sec. 11.4.5 

Johnson and Welch notation for that 
value of t 0 for which the probability 
P(/,«,*o) takes a specified value e 
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Symbol 

See 

Meaning 

to 

n. 73 

Johnson and Welch notation for a given 
level of non-central t 

t 1, tif 

pp. 75, 76 

Successive approximations obtained in the 
iterative method for calculating f (/,$,«) 

V 

p. 9 

Upper limit of a variable, above which an 
item is defective 

V 

p. 72 

Johnson and Welch notation for standard¬ 
ized normal deviate exceeded with prob¬ 
ability p\ corresponds with our K 

Vy 

p. 76 

Johnson and Welch notation for the normal 
deviate exceeded with probability pi 

U' 

p. 48 

Upper limit of the mean of a set of items, 
above which the set is defective 

Oy 

P- 47 , 

equation (91' 

Tolerance limit on U for a confidence coeffi¬ 
cient y. Minimum estimate of U sufficient 
to give a confidence y that the fraction 
defective will not exceed a fixed py 

u 

p. 72 

Johnson and Welch notation for sample 
statistic (L — £)/s , that is, the observed 
difference between L and the sample mean 
in units of the sample standard deviation: 
corresponds with our k 9 

Mo 

p. 72 

Johnson and Welch notation for critical 
value of statistic u ; corresponds with our k 

W 

p. 73 

Johnson and Welch notation for quantity 
in denominator of noncentral t denoting a 
variable distributed, independently of z, 
as x 2 // w ith / degrees of freedom; cor¬ 
responds with our (s/<r) a 

X 

p. 34 

Sx, the sum of the observations 

k 

p. 67 

equation (183) 

Value of X at which minimum value 
f* of Y r occurs 

x% or x 

p. 12 

A sample observation. A measurement on 

(i - 1, * • ‘ , N) 


a variable quality characteristic 

£ 

p. 12 

equation (4) 

Sample mean of a variable x 


p. 43 

Sample mean of a variable x for the ith 

(i - 1 ,•••,*) 


sample of a group of h samples 


p. 56 

equation (113) 

Abscissa of the point having the same 
ordinate as the point ($,s) on the bound¬ 
ary of the acceptance region for a two- 
sided criterion 

X* 

p. 73 

A variable having the chi-square distribu¬ 
tion 

r 

p. 34 

2z*, the sum of the squares of the observa¬ 
tions 
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Symbol 

See 

Meaning 

Ye 

p. 36 

equation (70) 

Boundary between error and acceptance 
regions when values of X are plotted 
horizontally and values of Y vertically; 
minimum value of Y that can be obtained 
for any given value of X 

Yn 

p. 36 

Boundary between rejection and accept¬ 
ance regions when values of X are plotted 
horizontally and values of Y vertically; 
parabola defined by equation (55), using 
the equality sign 

Y' 

p. 37 

equation (71) 

Sum of squares of adjusted observations, 
that is, observations from which a 
constant has been subtracted 

y' 

1 K 

p. 37 

equation (74) 

Boundary between the error and accept¬ 
ance regions when values of X are plotted 
horizontally and values of Y' vertically 

Y'h 

p. 37 

equation (73) 

Boundary between the rejection and 
acceptance regions when values of X are 
plotted horizontally and values of F' 
vertically 

Vn 

p. 67 

equation (184) 

Minimum value of the curve F«; a value of 
Y below which the lot is acceptable with¬ 
out regard to X 

r R 

p. 69 

equation (195) 

Minimum value of the curve Y f R ; a value of 
Y* below which the lot is acceptable 
without regard to X 

V 

p. 74 

equation (225) 

Johnson and Welch notation for a function 
of t Q and /, one of the arguments against 
which X is tabled 

y' 

p. 74 

equation (226) 

Johnson and Welch notation for a function 
of t 0 and /, one of the arguments against 
which X is tabled 

z 

p. 61 

Population mean of the normally distri¬ 
buted variable z 

Zx 

p. 61 

Value for Z for which it is required that the 
probability of rejection be a 

z 2 

p. 61 

Value of Z for which it is required that the 
probability of acceptance be 0 

* 

p. 61 

A normally distributed variable 

z 

p. 73 

Johnson and Welch notation for a variable 
distributed normally about zero with unit 
standard deviation 


p. 85 

x + k A s 


p. 85 

2 + bus 
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CHAPTER 2 


TOLERANCE LIMITS FOR NORMAL DISTRIBUTIONS 

1. NATURE OF TOLERANCE LIMITS 

1.1. Definition of Tolerance Limits 

A common method of specifying the quality of manufactured product 
is to set a range, the bounds of which are called “tolerance limits,” such 
that a certain percentage of the product made under commercial condi¬ 
tions may be expected to have a quality falling between the limits. For 
example, tolerance limits May be placed on dimensions to indicate the 
extent of interchangeability of parts. Tolerance limits are also used 
in other fields, for example medicine and psychology, to indicate the 
range of normal variation in various characteristics. In general, toler¬ 
ance limits are limits within which lie a certain percentage of the indi¬ 
viduals of a statistical universe. 

The satisfactory application of tolerance limits requires that assign¬ 
able causes of variability be detected and eliminated so that the remaining 
variability may be considered random. In other words, prediction from 
past data of the proportion of future observations that will be contained 
within fixed limits assumes that the successive values of the quality 
characteristic in question may be considered randomly chosen from a 
statistical universe.* 

1.2. Class of Tolerance Limits Considered 

The tolerance limits considered in this chapter are symmetrical about 
the mean, and they are based on the assumption that the underlying 
statistical universe may be described by a normal distribution. When 
the true mean and standard deviation g and <r of this normal distribution 
are known, tolerance limits are formed by adding to and subtracting from 
the mean a certain multiple K of the standard deviation; the appropriate 
multiple is easily obtained from tables of the normal distribution and 
depends only upon what proportion of the universe is to be included 
within the limits, t Values of K corresponding to various proportions 

* A detailed discussion of the nature, use, and practical basis for tolerance limits 
may be found in Shewhart, Statistical Method , Chap. II, pp. 50-79. In fact, this is 
the original source on which subsequent work on tolerance limits has been based. 
(For full citations, see Sec. 4.) 

t See Chap. 1, Sec. 2, for further discussion of this point. 
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included within the tolerance range may be found by entering Table 1.1 
of Chap. 1 with € equal to half the proportion excluded . For example, 90 
percent of the. population is included in the interval n — 1.645cr to 
/i + 1.645<r, since 90 percent inclusion corresponds with 10 percent 
exclusion, or € = 0.05. 

In most practical situations the true values fj. and a are not known, 
and tolerance limits must be estimated from the mean X and the standard 
deviation s of a sample of N observations. While the limits m ± 1.6450- 
will include 90 percent of the universe, clearly the same assertion cannot 
be made of the limits X ± 1.645s. The proportion included between the 
limits X ± Ks depends, for example, on whether the sample mean is 
unusually large, reasonably close to the population mean, or unusually 
small. The sample standard deviation is also subject to chance fluctua¬ 
tions that affect the proportion included between the limits. In fact, if a 
value of K is fixed, no two samples from a given universe will lead to 
the same tolerance limits, since both X and s will vary from sample to 
sample. 

It is clear then that the proportion of the universe included between 
the limits X± Ks is a random variable, and that it is impossible to 
determine K so that the limits will always contain a specified proportion 
P of the universe. It is possible, however, to determine K in such a way 
that in a large series of samples from normal universes a certain propor¬ 
tion 7 of the intervals X ± Ks will include P or more of the universe; 
7 is called “the confidence coefficient,” since it is a measure of the confi¬ 
dence with which we may assert that a given tolerance range includes at 
least P of the universe. 

Intuitively, it seems reasonable that the values of K for use with 
sample means and standard deviations will be somewhat larger than the 
values used if the true mean and standard deviation are known. In 
fact, by taking K very large the probability that the interval X ± Ks 
will contain at least P of the universe may be made as close to 1 as is 
desired. Since the length of the tolerance interval is 2 Ks, the smaller K 
is, the more meaningful and the less trivial a tolerance range is. The 
position is, therefore, one of being able to state broad generalities with 
little risk of error, or to make more precise statements at greater risk. 
The problem is to find the smallest factor K that may be associated with a 
specified confidence coefficient y, proportion P, and sample size N . 
Table 2.1 presents such factors for various combinations of 7 , P, and N. 
Before discussing the use of Table 2.1, it is desirable to contrast the tol¬ 
erance limits it yields with other kinds of limits. 

1.3. Other Kinds of Limits 

1.3.1. One-sided Tolerance Limits. —Instead of specifying a two- 
sided interval that will contain a certain percentage of the product, it 
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will frequently be appropriate to specify a one-sided limit such that a 
fixed percentage of the product will have a quality greater than (or a 
quality less than, as the case may be) this limit. Problems of this sort 
may be solved by the methods of Chap. 1, Sec. 9.2. 

1.3.2. Multivariate Tolerance Limits. —Certain methods of setting 
tolerance limits have been generalized by Wald and by Paulson to the 
problem of setting simultaneous limits on several quality characteristics 
of a single item. In this connection, see also Chap. 3. 

1.3.3. Tolerance Limits for Non-normal Distributions. —Wald has 
considered the problem of setting tolerance limits on the basis of large 
samples for distributions of any known form, normal or otherwise. 
Application of these tolerance limits to normal distributions gives unsatis¬ 
factory results; their applicability (as contrasted with their theoretical 
correctness when N aoproadies infinity) to other forms of distribution 
has not been determined 

1.3.4. Distribution-free Tolerance Limits. —Sometimes it is desirable 
to set tolerance limits without making any assumption about the form, 
normal or otherwise, of the underlying distribution. Problems of this 
sort have been discussed by Wilks and by Wald. The problem of deciding 
whether to use tolerance limits based on the normal distribution, those 
based on some other distribution, or those which involve no assumption 
about the form of the distribution is one which, though of a kind common 
in applied statistics, has no satisfactory solution. Tolerance limits based 
on the normal distribution are substantially shorter for a fixed sample 
size than those based on no assumption about the distribution; but they 
may be irrelevant if the underlying distribution is too far from normal. 
The decision about the adequacy of the normality assumption may be 
based upon an analysis of measurements of a large number of items; it 
may be supported by theoretical considerations, for example, when the 
quality characteristic is known to be determined by many chance causes 
whose effects are essentially additive; or it may rest on a combination of 
theoretical and empirical evidence. It should be noted that if there is a 
stable universe from which the measurements are random observations, 
limits of the form X ± Ks will, in a fraction y of samples, include a 
certain proportion P f or more of the universe, where P' differs from the 
intended P by an amount depending upon the departure of the true 
distribution from normality.* 

1.3.5. Confidence Limits. —Tolerance limits should not be confused 
with confidence limits (or with fiducial limits) for the mean or other param¬ 
eters of the distribution. Confidence limits for the mean, which also 
are of the form X ± Ks, are computed in such a way that in a fraction 
y of the samples from which they are computed they will include the mean 

* For a fuller and more precise statement of this point, see Chap. 1, Sec. 9.4, 
including the quotation from Jennett and Welch. 



100 


TECHNIQUES OF STATISTICAL ANALYSIS 


Chap. 2 


of the distribution. Tolerance limits, in contrast, are computed in such a 
way that in a fraction 7 of the samples from which they are computed they 
will include at least a fraction P of the items in the distribution. Thus, 
as the sample size increases, the value of K for a confidence interval 
diminishes toward zero, whereas the value of K for a tolerance interval 
diminishes toward the normal deviate including a proportion P of the 
universe. 


2. TABLE OF FACTORS FOR TOLERANCE LIMITS 

2.1. Description 

Table 2.1 shows 3,880 values of the multiple K of the sample standard 
deviation $ to be added to and subtracted from the sample mean X, for 
various values of the confidence coefficient 7 , the proportion P of the 
distribution to be included within the tolerance limits and the sample 
size N . Four values of 7 are given: 0.75, 0.90, 0.95, and 0.99; five values 
of P: 0.75, 0.90, 0.95, 0.99, and 0.999; and 194 values of N : from 2 to 
102 by steps of 1, from 102 to 180 by steps of 2, from 180 to 300 by steps 
of 5, from 300 to 400 by steps of 10, from 400 to 750 by steps of 25, from 
750 to 1,000 by steps of 50, and infinity. 

2.2. Use of the Table: Example 

In connection with fire-control problems for a certain projectile it is 
desirable to know, with a confidence coefficient of 0.99, limits within which 
the muzzle velocities of 90 percent of the future rounds will lie. The 
required calculations are performed on the observed velocities as follows: 
First, add all the observations, calling the sum XX. Then add all the 
squares of the observations, calling the sum XX 2 . These two sums and N f 
the number of observations, are the only information needed from the 


sample; in the present example they are 

XX = 291,200 ft. per sec. ( 1 ) 

XX 2 « 393,114,400 ( 2 ) 

N = 216 ( 3 ) 

The mean and standard deviation are computed as follows: 

XX 

X = = 1,348.15 ft. per sec. (4) 

s = ylr^ wi aT- TP' = 49-82 ft- per 8ec- (5) 


Entering Table 2.1 with P = 0.90 and 7 = 0.99, we find for N = 215, 
K = 1.856, and for N — 220, K = 1.854; by linear interpolation we find 
that for N = 216, K = 1.856.* The required tolerance interval, in feet 

* Linear interpolation with respect to N will not result in an error of more than 
1 in the third decimal place. 
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per second, is then 1,348.15 ± 1.856 X 49.82; so we may assert with 99 
percent confidence that 90 per cent of all the future observations will lie 
between 1,255.7 and 1,440.6 ft. per sec. 

2.3. Cases outside the Range of the Table 

2.3.1. Formula for Large Values of N .—For values of N beyond the 
range of the table, the following formula may be used:* 


K = K n ( 1 +-%= + 
\ s/2N 


5xl + 10 \ 
12 N ) 


(«) 


where K„ (the value of K we would use if the true ju and <r were known) is 
defined by 


1 [ Kv> 
V2r i K ° 


2 (It - P 


(7) 


and x y is defined by 



( 8 ) 


Values of K„ are listed in Table 2.1 under the appropriate P columns and 
are the same for all y; they may also be determined easily from Table 1.1 
in Chap. 1. Values of x y may be found from any table of the areas of 
the normal curve, such as Table 1.1. For the values of y used in Table 
2.1, however, Table 2.2 gives the values of the functions of x required 
within the parentheses of formula (6). 

Unless N is too small, formula (6) also gives results of satisfactory 
accuracy for values of P and y not included in Table 2.1. When N 
> 160, y < 0.99, and P < 0.999, formula (6) gives values of K agreeing 
within 1 unit in the third significant figure with those in Table 2.1. 

2.3.2. Example. —In studying another characteristic of the projectile 
referred to in Sec. 2.2, the following sample values were obtained: 


N = 1,449 

(9) 

X = 670.40 

(10) 

.s = 37.45 

(11) 

Again assuming y =- 0.99 and P - 0.90, we 
and formula (6), 

have, from Table 2.2 

K - 1.645 (l+ > > « 50 . + 3.0883\ 
\ V1449 1449 ) 

1 = 1.720 (12) 


The upper and lower tolerance limits are, therefore, 734.81 and 605.99, 
respectively. 

* This formula was developed in the Statistical Research Group by Albert H. 
Bowker; see his paper cited in Sec. 4. 
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TABLE 2.2 

Coefficients to Facilitate the Use of Formula (6) for Computing K When N Is 

Large 


Coefficient 

Value 

y = 0.75 

y = 0.90 

y - 0.95 

y = 0.99 

Term in —— -. 

_*7„ 

5Xy * + 10 
12 

0.4769 

1.0229 

0.9062 

1.5177 

1.1631 

1.9606 

1.6450 

3.0883 

y/N 

Term in -i. 

N 


3. CONSTRUCTION OF THE TABLE 

3.1. Formula 

The formula for K used in Table 2.1 is* 


K = 



(13) 


where N is the number of observations, P is the proportion of the distri¬ 
bution to be included within the tolerance limits, y is the confidence 
coefficient, r is a function of 1 fy/N and P defined by 


and Xy is defined by 


J_ fVN +r 

V2rJ ,J__ r 


c 2 dt 


= P 


P(x 2 > X?) - 7 


(14) 

(15) 


in terms of the chi-square distribution for n degrees of freedom, where 


n — N — 1. 


(16) 


3.2. Basic Network of Points 

K was computed directly from formula (13) for all values of n (that 
is, of N — 1) from 1 to 30, for all even values of n from 30 to 60, for 
values of n from 60 to 200 at intervals of 10, and for values of n from 199 
to 999 at intervals of 50. For other values of n, values of K were inter¬ 
polated into this basic network of points by means of 4-point Lagran- 
gian interpolatory coefficients taken from the table published by the New 
York Mathematical Tables Project. 

3.3. Percentage Points of x 2 

For all values of n from 1 to 30 and for values from 30 to 100 at 
intervals of 10, the percentage points of x 2 were obtained from Thomp- 

* This formula was developed in the Statistical Research Group by A. Wald and 
J. Wolfowitz; see their paper cited in Sec. 4. 
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son’s table. Since for even n the probability of exceeding any feiven value 
X 2 is given by the partial sum of a Poisson series, the required values 
between n = 30 and n = 60 (except n = 40 and n = 50) were obtained 
by inverse interpolation in Molina’s table* using Salzer’s inverse inter¬ 
polation coefficients. The percentage points of x 2 for n > 100 were found 
from a Cornish-Fisher type of expansion applied to the x 2 distribution.t 

3.4. Computation of r 

To compute r, Newton’s method was applied to the function 

0(r) = !--- f v '* + V' ^ dt - /* = 0 (17) 

V2ir J J___ r 

Vn 

The NYMTP tables of the probability integral are conveniently 
arranged for this, since th€f‘ordinates of the normal curve are tabulated 
beside the integral, and forward interpolation in both the integral and 
the ordinate may be expressed dimply in terms of the ordinate. 

4. REFERENCES 

Bowker, Albert H.: “Computation of Factors for Tolerance Limits on a Normal 
Distribution When the Sample Is Largo,” Annals of Mathematical Statistics , 
Vol. 17 (1946), pp. 238-240. 

Goldberg, Henry, and Levine, Harriet: “Approximate Formulas for the Percent¬ 
age Points and Normalization of t and x 2 ,” Annals of Mathematical Statistics , 
Vol. 17 (1946), pp. 216-225. 

Mathematical Tables Project, Work Projects Administration for the City of New 
York, Federal Works Agency: Tables of Probability Functions , Vol. 2, National 
Bureau of Standards, Washington, D. C., 1942, 344 -}- xxi pp. 

-: Tables of Lagrangian Interpolation Coefficients , Columbia University Press, 

New York, 1944, 392 4* xxxvi pp. 

Molina, E. C.: Poisson's Exponential Binomial Limit , D. Van Nostrand Company, 
Inc., New York, 1942, 47 pp. 

Paulson, Edward: “A Note on Tolerance Limits,” Annals of Mathematical Statistics , 
Vol. 14 (1943), pp. 90-93. 

Robbins, Herbert: “On Distribution-free Tolerance Limits in Random Sampling,” 
Annals of Mathematical Statistics , Vol. 15 (1944), pp. 214-216. 

Salzer, H. E.: “Table of Coefficients for Inverse Interpolation with Advancing 
Differences,” Journal of Mathematics and Physics, Vol. 23 (1944), pp. 75-102. 
ScheffA, H., and Tukey, J. W.: “A Formula for Sample Sizes for Population Toler¬ 
ance Limits,” Annals of Mathematical Statistics , Vol. 15 (1944), p. 217. 

* In Molina’s notation 



t The application of the Cornish-Fisher method to the x* distribution was carried 
out in the Statistical Research Group by Henry Goldberg and Harriet Levine; see 
their paper cited in Sec. 4. 



110 


TECHNIQUES OF STATISTICAL ANALYSIS 


Chap. 2 


Shewhart, Walter A., with the editorial assistance of Deming, W. PJdwards: 
Statistical Method from the Viewpoint of Quality Control, Graduate School, U.S. 
Department Qf Agriculture, Washingon, D. C., 1939, 155 + ix pp. 

Thompson, Catherine M.: “Table of Percentage Points of the x 2 Distribution/’ 
Biometrika, Vol. 32 (1941), pp. 188-189. 

Wald, Abraham: “Setting of Tolerance Limits When the Sample Is Large,” Annals of 
Mathematical Statistics , Vol. 13 (1942), pp. 389-399. 

-: “An Extension of Wilks’ Method of Setting Tolerance Limits,” Annals of 

Mathematical Statistics, Vol. 14 (1943), pp. 45-55. 

-, and Wolfowitz, J.: “Tolerance Limits for a Normal Distribution,” Annals of 

Mathematical Statistics, Vol. 17 (1946), pp. 208-215. 

Wilks, S. S.: “Determination of Sample Sizes for Setting Tolerance Limits,” Annals of 
Mathematical Statistics, Vol. 12 (1941), pp. 91-96. 

-: “Statistical Prediction with Special Reference to the Problem of Tolerance 

Limits,” Annals of Mathematical Statistics , Vol. 13 (1942), pp. 400-409. 



CHAPTER 3 


MULTIVARIATE QUALITY CONTROL, ILLUSTRATED 
BY THE AIR TESTING OF SAMPLE BOMBSIGHTS 

Page 

1. Introduction.113 

1.1 Type of Problem.113 

1.2 Problems of Air-testing Bombsights.116 

2. Proposed Sampling Plan.119 

2.1 Selection of Sample Sights and Arrangement of Bcmbing.119 

2.2 Control Charts.122 

2.2.1 General Plan.122 

2.2.2 Control Chart for Mean Range.123 

2.2.3 Control Chart for Mean Deflection.127 

2.2.4 Control Chart for Ri<nge Variance.127 

2.2.5 Control Chart for Inflection Variance.131 

2.2.6 Control Charts Combining Range with Deflection.131 

2.2.7 Illustration of the Use ot Control Charts.135 

2.3 Action When Results Are Bad.137 

2.4 Experimentation.140 

3 Formulas and Considerations Affecting Revisions of the Plan.142 

3.1 General Considerations.142 

3.1.1 Occasions for Revision.142 

3.1.2 Numbers of Sights, Flights, and Bombs.143 

3.1.3 Revision Based on New Experience.144 

3.1.4 Assumptions Involved.146 

3.2 Means.147 

3.2.1 Control Limits for Means.147 

3.2.2 Asymmetry of Control Limits and Bias in Base Period.149 

3.2.3 Yardsticks for Differences of Means.150 

3.3 Variances.151 

3.3.1 Control Limits for Variances.151 

3.3.2 Yardsticks for Variances.152 

3.4 Comprehensive Measures.152 

3.4.1 Control Limits for the 7”s.152 

3.4.2 Yardsticks for the Ts .156 

4. Background and Reasons for the Plan.156 

4.1 Objectives of Sampling Inspection.156 

4.2 Reasons for Random Selection and for Lot Size 20.158 

4.3 Relation to Experimental Design.159 

4.4 Efficient Formulas for Utilizing Results.160 

4.5 Exact Probabilities. 163 

4.6 Reasons for the T Method of Combining Range and Deflection Errors. . 164 

5 Statistical Analysis of Certain Test Bombings.168 

5.1 Summary of the Data.168 

5.2 Analysis of the Data.170 

5.3 Ratings of Equipment and Personnel by Test Bombing.176 

6. References.183 


111 













































112 TECHNIQUES OF STATISTICAL ANALYSIS Chap. 3 

Pass 

FIGURES 

3.1 Control Chart for Mean Range f.124 

3.2 Control Chart for Mean Deflection d .126 

3.3 Control Chart for Range Variance s 2 r .128 

3.4 Control Chart for Deflection Variance s 2 d .130 

3.5 Control Chart for Error in Mean Point of Impact Tu .132 

3.6 Control Chart for Dispersion about Mean Point of Impact To .134 

3.7 Control Chart for Over-all Error To .136 

3.8 Over-all Measure of Error T$ Applied to Sample Data.178 

3.9 Errors in MPI in Sample, as Measured by T 2 M .179 

3.10 Dispersion Measure T 2 D Applied to Sample Data.180 

TABLES 

3.1 Summary of Bombing Errors in 65 Flights (260 bombs).169 

3.2 Distribution of Flights by Personnel and Equipment.170 

3.3 Effect of Equipment on Over-all Accuracy of Bombing.181 

3.4 Variation of Over-all Accuracy with Personnel.182 


Harold Hotelling, University of North Carolina 

















CHAPTER 3 


MULTIVARIATE QUALITY CONTROL, ILLUSTRATED 
BY THE AIR TESTING OF SAMPLE BOMBSIGHTS 

1. INTRODUCTION 

1.1* Type of Problem 

Systematic plans of sampling for acceptance inspection or quality 
control are for the most part based on simple features of the individual 
articles inspected. Usually each article is classified merely as defective 
or nondefective. In a minority of cases there is a finer classification 
involving measurement of continuous variates. In the inspection of 
simple articles, including component parts of complicated assemblies, 
inspection by attributes is considered appropriate; but in the inspection 
of complicated assemblies, especially when inspection is costly, measure¬ 
ments are apt to be advantageous." Furthermore, the performance of 
complicated assemblies is usually a function of several different variables. 
When several quality characteristics are measured, there arises a problem 
of combining the several measurements on each article into a single 
measure (“score” or “index”) of the over-all performance of the article.f 
In the absence of a formally defined procedure, methods of a more or 
less subjective character for combining the different measurements to 
form a judgment of degree of acceptability are bound to develop. The 
simplest such method is to apply customary practices to each variate 
separately and to reject whenever the usual rules applied to any one 
variate alone lead to a verdict of rejection. This procedure, however, 
has the effect of changing drastically the probabilities, both of accepting 
what is bad and of rejecting what is good. For example, if a sampling 
scheme is designed to give a probability of 0.05 of rejecting a lot of a 
certain quality on the basis of measurements of one variate, and if this 
scheme is applied to each of two variates measured on the same articles, 
then if the two variates are independently distributed, rejection on the 
basis of one or the other variate will take place in 9.75 per cent of sub¬ 
mitted lots of the given quality instead of 5 per cent as originally con¬ 
templated. The discrepancy between the actual and the intended 
probabilities of rejection will increase rapidly with the number of inde¬ 
pendent variates when such a rule is followed. 

* For a further discussion of this point, see Chap. 1, Sec. 1. 
t For a further discussion of this point, see Chap. 1, Sec. 10.1. 
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An alternative rule, erring in the opposite direction, would be to 
accept any lot for which the univariate scheme applied to any one of 
the variates calls for acceptance. Thus rejection would require that all 
the variates take such values as to call for rejection. For two inde¬ 
pendent variates, a probability of rejection intended to be 0.90 would 
actually in such a case be only 0.81. 

These probabilities can be adjusted so as to become equal to 0.05, 
or such level as is chosen, by altering the acceptance level for each variate 
separately. However, this brings in additional difficulties. The variates 
may not be mutually independent, and then calculations such as the fore¬ 
going must be altered to take into account the multivariate distribution. 
Furthermore, it will often not be known whether they are independent or 
not; or if they are mutually dependent, the character of the dependence 
may be known only imperfectly. Thus the correlation coefficient may 
have to be estimated from a preliminary sample of size so small as to 
leave its exact value somewhat uncertain. Any acceptance probabilities 
based on such a correlation coefficient will then likewise be uncertain. 

Rules such as those suggested above have another defect. An article 
close to the margin of acceptability with respect to one variate may well 
be marked for acceptance or rejection on the basis of the other variates 
involved. Unusual excellence in one respect may often atone for a slight 
departure in another way from what would otherwise be considered 
satisfactory. A formula is then needed for combining the measures in 
the several dimensions into a single measure of excellence, or of adherence 
to specifications or to a desired ideally uniform set of dimensions. Such 
a formula should give a measure of departure from the ideal that will 
increase with a departure in any dimension. It should lend itself to an 
exact interpretation in terms of probability, taking account of the neces¬ 
sarily limited size of any preliminary sample used to determine the 
parameters entering into the formula. 

Formulas of this type are here developed on the basis of an expression 
known as the generalized Student ratio T, introduced in 1931 for the 
testing of hypotheses about the location of the means of multivariate 
distributions and closely related to the exponent in the multivariate 
normal distribution. * If errors in two or more dimensions were mutually 
uncorrelated and of equal variance in the sample representing past 
experience, the value of T for a new individual would be directly pro¬ 
portional to the distance from the point whose rectangular coordinates 
represent the ideal to the point representing the actual measurements. 
Thus, for a bomb dropped on a target, T would in such a case be simply 

* See Hotelling, “The Generalization of Student's Ratio." (For full citations, 
see Sec. 6.) 
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the distance of the bomb from the center of the target, expressed in terms 
of a unit equal to the standard deviation found in the earlier experience. 
Actually, the variances of range errors in bombing are appreciably less 
than those of deflection errors.* Consequently, the expression used for 
T 2 has different coefficients for the two squared terms. In addition, it 
has a term in the product of range and deflection errors, the small coef¬ 
ficient of which results from a slight correlation between the two kinds of 
error in the available experience. This T 2 serves as a measure of the 
deviation of the particular bomb from the center of the target, a measure 
more accurately interpretable in terms of probability than is the actual 
distance. By adding the values of T 2 for all th 3 bombs dropped with a 
particular bombsight, we have a measure useful in judging the accuracy of 
that bombsight. There are also other ways of combining these 7”s to 
throw light on particular features. 

A great advantage of' all these T statistics is that, in samples from 
normal distributions, they have probability distributions that can be 
determined exactly . Many statistics have distributions involving what 
may be called 11 nuisance parameters,” that is, parameters which are not 
known exactly from general considerations or basic scientific constants 
but have to be estimated, with more or less uncertainty, from fallible 
observations. This is true, for example, of the actual distance of a 
bomb from a target, whose distribution depends not only on the mean 
value of such distances but also, and in a complicated way, on the direction 
of the deviation. The parameters specifying the distribution of the 
direction—which is really irrelevant—are nuisance parameters in the 
distribution of the actual distance, confusing its interpretation as a 
measure of accuracy of the bomb. But the T statistics have been con¬ 
structed in such a way that there are no nuisance parameters involved in 
their distributions, at least as long as we are dealing with data from a 
fundamental distribution of the normal form. 

It is to be expected that these T statistics will have many applica¬ 
tions in sampling inspection and in other fields. In particular, wherever 
a manufactured article is to be judged, not on the basis of measurement of 
parts, but on the basis of final performance as measured by several vari¬ 
ables, there seems to be room for the utilization of such methods. A case 
of this kind, which is here considered in detail and for which the methods 
presented here were originated, is that of air testing sample bombsights. 
It is presented here as a precedent that will be useful in other situations 
in which final performance is determined by multiple measurements. 

* An error in range is an error in the direction of the airplane’s heading at the 
time of release. An error in deflection is an error in a direction perpendicular to the 
plane’s heading. 
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The mathematical theory is not given in full in this chapter, but the 
methods recommended will, it is believed, be fully intelligible on the basis 
of; the material presented here. * 

Besides the T distribution, several other innovations are here sug¬ 
gested, both graphic devices and methods of applying to statistical 
quality control some of the ingenious devices and concepts that R. A. 
Fisher has introduced into the design of experiments. 

1.2. Problems of Air Testing Bombsights 

The dropping of bombs from airplanes for the purpose of testing 
bombsights submitted by manufacturers is a phase of quality control. 
The art and the literature of quality control, so greatly expanded of late, 
are to a certain extent applicable to this bombing procedure. Yet test 
bombing presents several hovel features that make the situation very 
different from those heretofore dealt with in the literature and usual prac¬ 
tice of quality control. These peculiar features, which make this no 
ordinary problem and render the usual methods inadequate and to some 
extent inappropriate, are the following: 

(1) In contrast to the simple articles to which routine mass sampling 
inspection is usually applied, with at most one or two numbers recorded 
for each article tested, and often merely a “good” or “bad” verdict, the 
bombsight is a highly complex instrument with a multiplicity of parts, 
each of which is subject to various possible defects. 

(2) The process of putting together these parts in selective assembly, 
matching an oversize part with an undersize part to lit in with it, grind¬ 
ing and adjusting when necessary, is a series of jobs requiring great skill, 
whose results are to be judged by the performance of the finished instru¬ 
ment rather than by measurement of parts. Performance, however, is a 
complicated thing. If quality is to be measured by performance and 
expressed by one or a few numbers, caution must be exercised in the 
choice of these numbers and their interpretation. 

(3) Air testing is only one of a series of tests and inspections to which 
a bombsight and its parts are subjected. It is the final step, and an 
exceptionally costly one. For elimination of defects the main reliance 
is properly placed on earlier and cheaper steps. A great deal of careful 
screening out of defectives has taken place prior to the final stage of air 
testing, with which alone this chapter is concerned. 

(4) The errors of test bombing are not by any means due entirely to 
defects in the sight. In spite of all attempts at standardization there are 
bound to be variations in the weather, the altitude from which the bombs 
are dropped, the bombs themselves, the airplane, the auxiliary equip- 

* It is planned to present the mathematical theory in the Annals of Mathematical 
Statistics. 
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ment, and the personnel engaged in flying, bombing, and observing the 
results. The variance in the results of the bombing that results from 
causes having nothing to do with the quality of the bombsight indicates 
that, while valuable statistical inferences are possible from large numbers 
of cases, it would be a dubious procedure to take any drastic action on the 
basis of the four or six bombs usually dropped with a sight, except perhaps 
in the most extreme cases. 

(5) When bombing with a particular sight shows unsatisfactory 
results, the action ordinarily taken is not the rejection of the sight, but 
its adjustment or repair. 

(6) Whereas the usual sampling inspection for quality control is 
carried out very cheaply for each single inspection by a workman of 
moderate skill using some simple instrument, the dropping of a test bomb 
requires a pilot, a bombardier, an airplane with gasoline and maintenance, 
an airfield, a target area, stations for observing the point of impact and 
personnel to man them, and Anally, plotting and computations. 

The function of air testing is clearly not the elimination of individual 
bad sights or of bad lots of 10 from which the sights tested are taken. 
This is done far more efficiently by inspection of parts and by shop testing 
of the finished bombsight, both by the manufacturer and by the govern¬ 
ment representatives at the factory, as well as by the shop testing of the 
sample sent to the government proving ground. The principal functions 
which air testing is capable of fulfilling appear rather to be the following: 

(1) Keeping a running record of quality designed to compare the 
work of different productive units, and of the same unit at different times. 

(2) Detection of irregularities, trends in quality, and deviations from 
customary levels in the average performance of sights produced over a 
period. 

(3) Exploration of possible types of defects, either in design or in 
production, not previously recognized. 

(4) By experiments carefully designed so as not to conflict with the 
other purposes of air testing, to obtain judgments on differing models 
of sights, on auxiliary equipment, on bombing methods, and on personnel 
in a manner more efficient than separate experiments carried out solely to 
obtain these judgments. 

The high cost and uncertainty of air testing, with the relative accuracy 
and low cost of shop testing, indicate that only a very small number of 
bombsights should be air-tested. However, it does appear that some air 
testing should go on, for reasons outside those usual in sampling inspec¬ 
tion and to an extent that cannot be determined by calculation according 
to customary methods. In the specific, though flexible, plan suggested 
in Sec. 2, the amount of routine bombing called for is one-twentieth 
of that under procedures in effect in 1943 The exact proportion suitable 
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cannot be derived by calculation from the proportion of defects, which 
could not be ascertained reliably from the record because of the virtual 
absence of defects serious enough to cause rejection. The fraction sug¬ 
gested represents what is in the author’s view a reasonable proportion of 
bombsights to be tested if the improved statistical methods described 
herein are utilized. The plan can easily be modified, in accordance with 
Sec. 3, to fit any other proportion considered appropriate. 

In much quality-control work where individual inspection is cheap, 
inefficient statistical formulas are used frankly and deliberately when they 
are easy to calculate and to comprehend. The saving in computation 
represented by using the mean deviation instead of the standard devia¬ 
tion with many small batches may in some circumstances offset the 
increase of approximately one-eighth in the amount of inspection neces¬ 
sary to get the same accuracy. The range of a sample (that is, the differ¬ 
ence between the greatest and least of the values) is often taken as a 
measure of dispersion when it is felt that the inability of factory employes 
to understand any more sophisticated measure warrants the cost of the 
additional observations required to obtain the necessary accuracy. 

Considerations of this kind should have no place where the unit cost 
is as great as that of dropping a bomb. Information acquired at such 
great expense should not be wasted by the use of inefficient statistical 
methods of combining the observations. One object of this chapter is to 
set forth the most appropriate methods of combining the observations of 
the bomb coordinates and of making inferences from them. Some modi¬ 
fications of the usual kinds of control charts are introduced. Further¬ 
more, the contribution of modern statistical methods to the improvement 
of accuracy is not limited to formulas for combining the observations after 
they are made. Certain precautions, such as proper randomization and 
replication, can be arranged only before the observations are made. The 
application of these principles to air-testing sample bombsights by a plan 
in accordance with modern principles of experimental design is another 
leading purpose of this chapter. 

Section 2 consists of a specific plan for the selection of sample bomb- 
sights, their air testing, the calculations to be made from the results, a 
series of control charts of new types, and some simple considerations 
relevant to interpretation of the results and to action. 

Since Sec. 2 is easily adaptable to changed conditions and modifica¬ 
tions of the plan, formulas are given in Sec. 3 for calculating new control 
limits and other needed quantities under conditions more general than 
those of Sec. 2. Provision is here made for changing the number of bombs 
dropped per flight, the number of flights with each sight, the number of 
sights to be taken from a lot for air testing, and the base period with which 
comparison is to be made. 
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While Secs. 2 and 3 consist principally of simple rules for operating 
and for amending the sampling plan, with little mathematics or argu¬ 
mentation, Sec. 4 gives the background and reasons for the plan. The 
chief mathematical discussion has, however, been confined to a technical 
paper, where the mathematical reader will find a more precise account 
of some of the new ideas presented here for the first time. These ideas 
are applicable to a wide range of problems, not only in quality control but 
also in connection with bombing and firing in general, with mental test¬ 
ing, and with other problems in which two or more normally distributed 
variates are involved. 

Section 5 is an analysis of the results of dropping 260 ore-filled bombs. 
Measures of dispersion obtained from these bombs are substituted into 
the formulas of Sec. 3 to obtain the control limits specified in the plan of 
Sec. 2, together with coefficients for combining range and deflection 
errors and other needed ouantities. For any sampling plan, a base 
period is needed to get the plan started, and this is provided by the 
material analyzed in Sec. 5. This section also contains as by-products a 
specimen rating of equipment and personnel involved in that bombing, 
essentially to show how such ratings can be made, and a new simplified 
technique of analysis of variance with unequal class frequencies developed 
for the purpose of making these ratings. 

This chapter includes control charts having several novel features, 
which it is believed will also be found useful in other control-chart work. 
One is the choice of the functions combining different measures of 
accuracy, here range and deflection errors, to be plotted. These func¬ 
tions are invariant under transformations of coordinates and hence have 
probability distributions capable of exact determination. Another 
feature of the control charts here presented is the provision of yardsticks , 
for visual comparison of different plotted values, with a definite prob¬ 
ability interpretation. 

In fixing the control limits corresponding to specified probabilities, 
account is here taken of the limited number and consequent sampling 
fluctuations of the observations in the base period used for comparison. 
It is to be feared that in many control charts heretofore in use there has 
been an unconscious distortion of the probabilities because of neglect of 
this consideration, with a bias uniformly in the direction of exaggerating 
the significance of moderately large deviations from norms. Use of 
formulas such as those of Sec. 3 in industrial quality control should cor¬ 
rect this error. 

2. PROPOSED SAMPLING PLAN 
2.1. Selection of Sample Sights and Arrangement of Bombing 

(1) Whenever a new factory goes into production, each of the first 
40 bombsights produced and passing the usual shop tests of the govern- 
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ment inspectors, both at the factory and at the proving ground, is to be 
tested. If all these are found satisfactory, the remainder of the plan is 
to be applied to subsequent production, subject to withdrawal with 
reversion to the status of a new factory if later a series of unacceptable 
sights is offered. (Since the last contingency has not hitherto occurred, 
accurate formulation of a definite rule for such reversion must await the 
accumulation of this undesirable kind of experience.) 

(2) At each factory that has been approved after the first 40 bomb- 
sights, the sights of each model are to be numbered consecutively in the 
order of manufacture in each of two series. One series, designated A ) 
shall consist of those sights approved without unusual delay by the 
government inspector at the factory on first delivery by the manufacturer. 
The other series, designated B , shall consist of those sights first rejected 
by the government inspector and later approved after defects have been 
remedied, together with any other approved sights whose approval has 
for any reason been delayed. To each sight are to be affixed both the 
usual serial number representing original order of manufacture and the 
letter A or B } with the number within the A or B series. The subse¬ 
quent procedures are to be applied separately to the A and B series for 
each factory and for each model. 

(3) Each series is to be divided into successive lots of 20, strictly in 
the order of original manufacture. 

(4) From each lot of 20, two sights are to be selected by an objective 
chance procedure and sent to the proving ground. No element of per¬ 
sonal judgment or of rating on inspection should enter into this selection, 
which may be made in the following manner: the procurement authorities 
will send to the chief inspector at the factory a series of opaque sealed 
envelopes marked on the outside with the first and last serial numbers of 
the successive batches. An envelope is to be opened only after the 
inspection at the factory of all the sights in a batch has been completed 
and report made. Within the envelope will be found two numbers 
between 1 and 20, inclusive, prepared on a strictly chance basis, with 
equal probabilities for all twenty numbers. This must be done through 
actual utilization of a chance mechanism or of a table of random num¬ 
bers,* without human discretion other than in the selection of the mech¬ 
anism. One obvious procedure would be to mark twenty playing cards 
with the numbers 1 to 20, and after thoroughly shuffling to draw out two 
of them at random. 

(5) The remaining 18 sights of each lot are not to be held at the 
factory until completion of the air testing of the sample but should be 

♦For example, Fisher and Yates, Statistical Tables , Table XXXIII; Ken¬ 
dall and Smith, Random Sampling Numbers; Tippett, Random Sampling 
Numbers . 
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sent directly to service, subject to later recall or repair in the unlikely 
event that the sample is found seriously defective. 

(6) Upon arrival at the proving ground, the sample sights are all to 
be shop-tested sod adjusted as at present, with such further types of shop 
test as may be developed. 

(7) By an objective random procedure like that described in Para¬ 
graph (4) above, lots are to be selected for the air testing of the two sights 
representing the lot at the proving ground. It is suggested that one- 
fortieth of the lots produced by each manufacturer be selected by chance. 
(It is likely that this proportion should be reduced as the improved 
methods of shop testing prove their value.) 

(8) Each of the two sights representing each of the lots selected in 
accordance with Paragraph (7) is to be taken up in an airplane twice to 
be used for dropping bombs. 

(9) The routine flight U sting is to be arranged in periods, each con¬ 
sisting of four consecutive days, or four days as nearly consecutive as 
possible even though there may be breaks on account of Sundays or 
special circumstances The number of test flights to be made per day 
should first be determined and will here be denoted by k. This same 
number of flights k should be made on each day of the 4-day period, 
though it is not essential that k should be the same for different periods. 

(10) The sights to be tested in a 4-day period are to be the two sample 
sights from each of k lots. Each sight is to be tested on two different 
days of the period, and each lot is to be represented on all four days. 
Subject to these conditions, the assignment of the particular sight of a 
lot to be tested on a specific day is to be decided by objective chance, for 
example, by flipping a coin. The k lots represented may be from the 
same or different factories. They may be of different models and may 
be of A or B series or both. 

(11) Exactly four bombs are to be dropped on each flight. Range 
and deflection errors are to be measured with respect to the heading of the 
airplane at the time the particular bomb was dropped. 

(12) The numerical values for control limits and yardsticks given in 
this section are to be used only if the directions in the preceding eleven 
paragraphs are carried out exactly. If it is decided to revise the plan 
by dropping a different number of bombs on each flight, or by measuring 
range and deflection with respect to the true course instead of the heading, 
the numerical values will need to be amended in accordance with the 
formulas of Sec. 3. The number of bombs dropped per flight will be 
denoted by AT, with the understanding in this section that N = 4. No 
change should be made except as part of an explicit revision of the plan. 

(13) All measures of quality based on the bombing results should be 
algebraic functions of the coordinates of the bombs with respect to the 
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target, taken with their proper algebraic signs. Absolute values and 
mean deviations are not algebraic functions and should not be employed 
in this connection. Use of criteria based on mean deviations requires 
more bombing to procure comparable accuracy than the formulas used 
herein. Furthermore, in contrast to the T measures here recommended, 
no combination of mean deviations has an exactly known distribution. 

2.2. Control Charts 

2.2.1. General Plan. —Seven control charts, illustrated by Figs. 3.1 to 
3.7,* are to be kept for each factory and model and for the A and B series. 
On these charts certain indicators of quality of successive lots are plotted, 
strictly in the order of production. Entries on these charts should be 
made daily as soon as the data become available. The charts should be 
posted conspicuously at the proving ground where they will be seen by all 
concerned with the testing. Copies of them should be sent weekly to 
the procurement authorities, to the particular manufacturer concerned, 
and to the government inspector at the factory. The charts should all 
be drawn with the same horizontal scale, representing the order of the lot 
represented, and should be placed one above another with a clear indica¬ 
tion of the boundaries between them when this is practicable. Space 
should be left at the bottom for comments on exceptional cases and a 
notation of action taken with respect to such cases. 

The quantities represented on the charts, from top to bottom in 
order, will be designated by T 0 , T D , T M , s r , f, s d , d. The highest of these, 
T 0} is the most comprehensive measure of the bombing errors associated 
with the sights under consideration, and anyone who has time to consider 
only one measure should watch T 0 . The next two break down the over¬ 
all error into a part representing deviations from the mean point of impact 
(MPI) of the bombs on a flight and a part representing deviations of the 
MPI- from the target. All these first three measures are combinations 
of range errors with deflection errors. Of the remaining four quantities 
charted, two refer to range errors alone and two to deflection errors alone, 
two to the MPI and two to deviations from the MPI. 

Any general tendency to slipshodness and poor work in a factory is 

* These illustrative charts were made up from the actual bombing data discussed 
in Sec. 5 and are thus mutually consistent. Though these records of flights were all 
derived from different sights, they were arbitrarily grouped into consecutive pairs, 
representing fictitious single sights, and these pairs were again grouped into consecu¬ 
tive pairs representing samples from hypothetical lots, just as if the plan here pre¬ 
sented had been in effect. The last flight was omitted so as to leave 64 flights, a 
multiple of 4, as is required by this plan. The result is to give a reasonably good 
illustration of the kind of charts that can be expected in test bombing of the kind 
earned out with bombsights of the same general quality as the ones then tested. 
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more likely to be observed in rising curves in the upper charts, approach¬ 
ing and passing the control limits, whereas light on specific faults is to be 
sought by casting the eye downward gradually to the lower charts. 

The seven, quantities to be charted will be defined in the following 
paragraphs, beginning with the simplest. They will therefore be in a 
different order from that above. The actual ordinates are not propor¬ 
tional to the quantities mentioned above, except f and d, but to their 
logarithms. Consequently, arithlog (that is, semilogarithmic) paper 
should be used for the top five charts to save the trouble of looking up 
logarithms. Also, the vertical scales on these five top charts are to be 
marked on the left with numerical values equa to the squares of the T’a 
and s ’s since, as will be seen later, this avoids the labor of extracting 
square roots. The T’s and s ’s themselves are given on vertical scales at 
the right of these charts. 

We shall use r .and as symbols for range and deflection errors, 
respectively, in mils,* a,nd indicate by S the sum over the N ( = 4) bombs 
in a flight of the values of the expression following S. 

All the numerical coefficients, yardstick lengths, and control limits 
herein specified should be revised periodically as further experience 
accumulates; they will especially need to be revised after any change in 
technique has been in effect long enough to provide a sufficient basis in 
experience. Formulas for revision will be found in Sec. 3. 

2.2.2. Control Chart for Mean Range.—The control chart for f, illus¬ 
trated by Fig. 3.1, represents f F — Sr/N for each flight by a cross or a 
circle. For each lot the values of f F found from the two flights of the 
sight produced earlier than the other are represented by circles, while 
those for the sight produced later are represented by crosses. The mean 
of these four values of r F is f L) the mean of the range errors for the lot. 
The points representing f L for successive lots are connected by straight 
lines. 

Control limits are shown as two pairs of horizontal lines. . The prob¬ 
ability that a lot of sights constituting a random sample from the same 
population as the sights tested in the base period, with the same distribu¬ 
tion of errors of all kinds as in the test period, should be represented by 
a point outside the outer control limits is 0.01. The probability that such 
a lot should yield a value of f L falling outside the inner control limits 
under the same conditions is 0.05. The upper and lower limits corre¬ 
sponding to each probability are at equal distances, not from the zero 
line, but from a line representing the range error in the MPI of all the 
bombs dropped in the base period. This point is discussed in Sec. 3.2, 
where the formulas and numerical values for the control limits are also 

* A mil is an angular measure approximately equal to 1/6,400 of 360 degrees or 
to 1/1,000 of a radian, the exact definition varying from one usage to another. 
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Fig. 3.1.—Control chart for mean range f. 
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given. The 0.01 limits are represented by solid lines at 5.89 and —3.20 
mils; the 0.05 limits are shown by lines of dashes at 4.80 and —2.11 mils. 

If the zigzag line should get outside the band marked out by the inner 
control limits, t^is fact may be taken as a warning to be alert for any 
further indication of faults tending to make bombs fall consistently too 
far short of or too far beyond the target, as the case may be. If the outer 
control limits should be surpassed, the warning is more emphatic, since 
this will happen only once for 100 lots if there are only the same sources of 
range error as in the base period. An instance of exceeding the outer 
control limits should be made the occasion of an investigation at the 
proving ground to determine the cause. A definite trend upward or 
downward in the plotted quantity is another indication of a dangerous 
condition, t*ven before the inner control limits are reached. In such 
contingencies, unless the tr^nd or tbe excessive value of f L is definitely 
attributed to faults of the airplane, personnel, or other cause independent 
of the quality of the sight, tne manufacturer should be warned promptly. 
It is particularly important to give early warning of trends in order that 
there may be an opportunity to correct gradually developing faults before 
they become serious. Action to be taken when control limits are passed 
is specified further in Sec. 2.3. 

The control limits refer to f L but serve a secondary purpose also with 
respect to f s , the mean of the range errors for a sight, represented by a 
point that is ordinarily not marked but that can be estimated visually 
from a pair of crosses or of circles. The control limit having probability 
0.01 of being exceeded by f L has a probability approximating 0.20 of being 
exceeded by f s , and that having probability 0.05 for f L has probability 
practically 0.33 for f s , provided the number of bombs dropped in the base 
period is large, say 200 or more. Thus the fluctuations of range error in 
MPI can be followed visually with a fair idea of the relevant probabilities. 

To facilitate further the continuing comparison of means of range 
errors, a yardstick is provided at the left for the difference between the 
means of two lots , a longer yardstick for the difference between the means 
of two sights , and a still longer yardstick for the difference between the 
means of two flights. The vertical distance between points representing 
means of any of these types should exceed the length of the corresponding 
yardstick about once in 20 times if chance alone is operating. The 
means for the sights are not shown but are easily inferred as the centers 
of gravity of the two points representing the flights with the individual 
sight. A visual appraisal of the distance between these centers of gravity 
for the two sights representing a lot, in comparison with the middle yard¬ 
stick, will usually indicate clearly whether the difference in performance 
in this respect is unusually great. An excessive difference should be 
regarded as a tentative indication of some variation of conditions at the 
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Fig. 3.2.—Control chart for mean deflection d. 
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factory during the production of the lot represented by this sample. 
However, it might also be explained by a difference in weather or per¬ 
sonnel conditions between the two days on which one sight was tested 
and the two days, on which the other sight was tested; or by chance alone 
for every twenty such comparisons if these were to be made for all pairs. 
The yardstick lengths on the f chart are 9.47, 6.70, and 4.74 mils for 
flights, sights, and lots, respectively. 

A fallacy that must be guarded against in connection with the yard¬ 
sticks is a tendency to select the greatest and the least of a series of obser¬ 
vations and to regard them as significantly different if their heights 
differ by more than the length of the yardstick. This is wrong and 
likely to lead to far too many judgments of significant difference, because 
the yardstick lengths are determined on the assumption that they will 
be used for comparing only observations selected in a manner independ¬ 
ent of their magnitudes. I* is perfectly legitimate to use the yardstick 
to compare consecutive" observations, or the first and last of a consider¬ 
able series, though even so it must be remembered that significance will 
emerge fallaciously about once in 20 times. The detection of trends 
in quality should be facilitated by applying the yardsticks to observa¬ 
tions a considerable distance apart. Discontinuous changes may become 
evident when two consecutive observations differ by an amount approxi¬ 
mating the length of the yardstick, provided earlier observations are 
fairly well in accord with the first of the pair and later ones with the 
second. It is possible to devise exact statistical criteria for both trends 
and discontinuities, but workers in quality control stress the value of 
graphic devices. 

2.2.3. Control Chart for Mean Deflection.—A control chart for <2 
(Fig. 3.2) should be kept in exactly the same way as that for f. How¬ 
ever, the control limits will be farther apart in the case of the d chart 
and the yardsticks longer because the variance of deflection errors is 
greater than that of range errors. The 0.01 control limits are at 8.40 
and —2.79 mils; the 0.05 limits are at 7.05 and —1.44 mils. The yard¬ 
stick lengths are 11.65, 8.24, and 5.83 mils. 

2.2.4. Control Chart for Range Variance.—For each flight the esti¬ 
mated variance of range errors is defined by the formula 


o _ S(r - f) 
r ~ N — 


but may be found with a slight economy of labor by using the fact that 
S(r — f) 2 = Sr 2 — fpSr 

The control chart for range variance is to be on logarithmically ruled 
paper (Fig. 3.3). Values of 8 * for the four flights representing each lot 
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Fig. 3.3.—Control chart for range variance s*. 
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of bombsights are plotted on the same vertical line, two circles for that 
one of the sample sights of the lot produced earlier, and two crosses for 
the other. The arithmetic mean of these four values of s 2 is the best 
estimate of range variance for this lot of bombsights. The points repre¬ 
senting such means (which, because of the logarithmic scale, are slightly 
above the centers of gravity of the points representing individual values 
of si) are connected by a zigzag line. Only a single control limit is 
plotted in this case for each probability considered, since it is not neces¬ 
sary to be on guard against unusually small variances. When N — 4, 
the variance estimate for a lot has 12 degrees of freedom (3 for each of 
the 4 flights). The lot variance for range tiat will be exceeded by 
chance once in 20 times is, according to the estimate supplied by 
our preliminary sample, 82.87, and this is represented by the height of 
the control limit marked 0 05 in Fig. 3.3. The corresponding value for 
probability 0.01 is 104.4 (see See. 3.3.1 for the origin of these values). 

It should particularly be observed that the denominator of each of 
the estimates of variance here used equals the number of degrees of 
freedom, not the sample number. Using the sample number, as is 
often done, introduces unnecessary complications into the determination 
of the probabilities, and if applied in the present case is likely to lead to 
large errors. 

Yardsticks having functions similar to those of Figs. 3.1 and 3.2 are 
provided at the left of Fig. 3.3. The logarithmic vertical scale of Fig. 
3.3 makes possible the use of these yardsticks, which are to be regarded 
as subject to rigid translation. The exact comparison of estimates of 
variance in terms of probability must be made by means of a function 
of their ratio, or the difference of their logarithms. This is the reason 
for using a logarithmic chart for s 2 . In the charts for f and <J an arith¬ 
metic scale was used, since the comparison between two means must 
be made in terms of their difference, not their ratio. The lengths of 
the yardsticks for flights, sights, and lots correspond, respectively, to 
the variance ratios 15.45, 5.82, and 3.28. 

To illustrate the use of these yardsticks, suppose that s 2 should 
have the value 10 for one lot and 40 for the next. Both points are well 
within the control limits. But the ratio 4 exceeds the yardstick 3.28, 
as is immediately obvious when the points are charted, to such an extent 
that the probability is considerably less than 0.05 of the variances deviat¬ 
ing so far from equality by chance alone. Unless some explanation 
connected with the testing is forthcoming, this is to be regarded as a 
danger signal suggesting deterioration of the sights in a manner leading 
to wider scatter in range. The two sights belonging to the lot showing 
the variance 40 should be examined for sources of excessive range vari¬ 
ance such as loose rate knobs, and any explanation found should be indi¬ 
cated in the space below the control charts reserved for notes. 
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Fig. 3.4.—Control chart for deflection variance sj. 
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2.2.5. Control Chart for Deflection Variance— The variance of deflec¬ 
tion errors is to be treated in exactly the same way, except that the 
control limits are in this case at 125.4 and 157.9. The yardstick lengths 
are the same as for the range-error variance chart (see Fig. 3.4). 

2.2.6. Control Charts Combining Range with Deflection. —The indi¬ 
cators of quality to be used combining range and deflection errors are 
denoted by T with the following subscripts: 

0 for an over-all measure 

M for a measure based only on the mean point of impact (MPT) 

D for a measure based on deviations from the MPI 

A second subscript will also be used when need* 3 d to make the meaning 
clear, as follows: 

B for an individual bomb 
F for all the bombs dropped in a flight 
S for the bombs dropped with the same sight 
L for the bombs dropped with the sights representing a lot 

It will frequently be convenient to omit the second subscript. 

The definitions of the various 7”s all make use of the same coeffi¬ 
cients a, 6, c, whose values may be taken at first, on the basis of the 
experience analyzed in Sec. 5, as follows: 

a = 0.021 724 b = 0.001 331 c = 0.014 360 

However, these values, and the corresponding control-chart limits, should 
be revised periodically as further experience accumulates. They will 
especially need to be revised after any change in technique of inspection 
has been in effect long enough to provide a sufficient basis in experience. 
Methods of revising them, and the other numerical quantities used in 
this part, are described in Sec. 3.1.3. 

Tq is defined for an individual bomb as 

T 0 \ = ar 2 + 2 brd + cd 2 

and f<?r a flight, sight, or lot as the sum of the values of T% B for the indi¬ 
vidual bombs involved. However, it will only be in exceptional cases 
that T% b will actually be computed, and ordinarily it will be convenient 
to computed T% F by the formula 

Tl F = aSr 2 + 2 bSrd + cSd 2 

where, as before, S stands for summation over all the bombs dropped on 
the flight. For a sight or a lot the value of Tg is found by adding together 
its values for the several flights involved. 
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Fig. 3.5.—Control chart for error in mean point of impact Tm . 
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T 2 m might be defined for a flight by the formula 
Tmf = N(ar 2 F + 2 bf F d F + cd\) 


but is more easily computed from the equivalent formula 
T 2 mf = af F Sr + 2 bf F Sd + cd F Sd 


For a sight or a lot the value of T 2 M) like that of Tg, is found by adding 
up the values for the flights involved. 

In every case T 2 0 is greater than T The difference is defined as 
jTJ. An algebraic identity shows that 


7*2 >_ 7*2 _ 7*2 

D F OF 1 AfF 

••= aS(r - r,y + 2bS(r - f F )(d - d F ) + cS(d 


d F y 


while for a sight or a lot the value of T 2 U is the sum of those for the flights 
involved. Thus in ct se T% depends only on the deviations from 

the mean points of impact for the respective flights, whereas T% depends 
only on these mean points of impact. 

The direct use of TV and To will be appropriate only when biases 
such as those disclosed by the statistical analysis of Sec. 5 have either 
been eliminated or definitely attributed to the bombsight itself rather 
than anything in the testing process. Until such elimination is achieved 
or such a judgment is reached, it is better to modify TV and T 0 so as to 
make them functions of deviations, not from the target, but from the 
mean point of impact of the bombs of the base period. Indicating this 
correction by placing a caret over T, and using f' and (V for the coordi¬ 
nates of the mean point of impact of the bombs of the base period, gives 


^ _ aS(r - f') 2 + 2 bS(r - f')(d - d') + cS(d - d') 2 _ T*f + U 

1 OF 1 " ' 1 

1+ ^' 1+ n' 


7*2 — 

1 MF 


N [a(r> - f') 2 + 2 b(f, - f')(d F - d') + c(d F - 3') 2 ] 




7V + U 


where n' is the number of bombs in the base period and 


U = N(af' 2 + 2br'd' + cd' 2 ) - 2 [af'Sr + b(r'Sd + d'Sr) + cct'Sd\ 

The numerical values obtained from the 260 bombs discussed in Sec. 5 
give, when N = 4, 

U = 0.64917 - 0.065903Sr - 0.084129Sd 




Scab of T 0 * Sca,# 01 T o 
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Fig. 3.6.—Control chart for dispersion about mean point of impact Td. 
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For a sight or a lot, corrected values tl or T 2 M are found by adding 
together the corresponding corrected values for the flights involved. 
Values of T D are not affected by the correction. 

For T* D y and Tg, control charts are to be maintained on arithlog 
paper in a similar form to those for s 2 and s% (see Figs. 3.5, 3.6, and 3.7). 
However, there is one difference in form: The zigzag line on each T 
chart plots the sums of the quantities represented by circles and crosses, 
instead of the means as on the s 2 charts. This results in all the quantities 
plotted being of kinds customary in statistical theory, and hence easily 
looked up in probability tables, at least to an approximation. A further 
advantage of this arrangement is that it renders the appearance of the T 
charts so unlike that of the s charts that they cannot be confused. 

On each of the T charts there is for each of the probabilities 0.05 and 
0.01 an upper but not a lo\*er control limit. The three yardsticks sup¬ 
plied in each case refer to d’ fferences, on a logarithmic scale, of sums for 
flights, sights, and lots, respectively, with probability 0.05. The values 
of T 2 for the control limits and yardsticks are given in the following 
table: 



*3 

o 

•i 

n 

n or f'o 

0.05 control limit. 

16.3 

41.2 

54.4 

0.01 control limit. 

21.2 

49.4 

64.0 

0.05 yardsticks: 




Flights. 

39.0 

5.8 

4.4 

Sights. 

9.6 

3.3 

2.8 

Lots. 

4.4 

2.3 

2.0 


These values require a small correction of a purely mathematical 
kind based on research not yet completed. 

2.2.7. Illustration of the Use of Control Charts.*—A glance at Fig. 3.1 
shows that all but two of the lots were well within the control limits for 
range MPI. Lot 9 falls slightly outside the 0.05 limit at what may be 
estimated as the 0.04 level. Since there is a reasonably large chance of 
1 out of 15 or 16 cases exceeding the 0.05 limit, and since lot 9 shows no 
shortcomings on the other charts, this deviation may be disregarded. 

A very different situation is presented by lot 2. As Fig. 3.1 shows, the 
MPI was in this case an average of nearly 6 mils short, a distance far 
beyond even the 0.01 control limit. Following this lot through the other 
control charts, we observe from Fig. 3.2 that its deflection error in MPI 
was very moderate, and from Fig. 3.4 that the deflection variance was 
exceptionally small. From Fig. 3.3 it appears that the range variance 

* This section is for illustrative purposes only and is based on data put together 
artificially, as explained in the footnote in Sec. 2.2.1 and in Sec. 5. 
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Fig. 3.7.—Control chart for over-all error To • 
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was far below the average. Moreover, in Fig. 3.3 the circles and crosses 
for lot 2 are all remarkably close together, indicating consistently low 
variance, that is, high accuracy in hitting closely around a point 6 mils 
short of the target. Figure 3.5 shows that the range error in MPI is too 
great to be concealed by averaging with the small deflection error. Upon 
looking at Fig. 3.6, it is apparent that lot 2 yielded the smallest value of 
T d of all the 16 lots plotted, thus demonstrating a rather remarkable 
accuracy in hitting the wrong point. The error in MPI and the close 
clustering about it offset each other sufficiently so that the bombs of lot 2 
are only moderately far from the target in terms of T 0y so that Fig. 3.7 
shows nothing remarkable. 

This combination of peculiarities might be brought about by a 
bombardier with very quick reactions, enabling him to keep the airplane 
on a remarkably good course and to maintain regularity in his behavior, 
but causing him to release the bomb consistently too soon. Such a man 
should be capable of development into an exceptionally accurate bombar¬ 
dier if he studied the control charts and particularly his own errors. 

2.3. Action When Results Are Bad 

If in any case the zigzag line crosses its 1 percent control limit, an 
investigation should be made of the cause of this anomaly, and the 
probable cause entered in the space provided for the purpose at the 
bottom of the control chart. If the investigation includes further air 
testing following shop adjustment, the same seven quantities should be 
computed for the new air tests and their significance judged by means of 
the probability formulas of Sec. 3. Entries of probable cause should 
indicate clearly whether the excessive bombing errors bringing about the 
anomaly are believed to indicate a fault in the sight or in some feature 
of the testing process. Pure chance may be blamed approximately once 
for every hundred lots from which sample sights are tested; this allotment 
of 1 anomaly in 100 applies separately to each of the control charts. 

Some clues as to the nature of the trouble will be provided by an 
intelligent reading of the control chart itself, with attention to the con¬ 
sistency of performance of the two sights in a lot and of the two flights 
with each sight, and observing whether the trouble is in range or deflec¬ 
tion, or both, and whether it introduces a bias into the MPI or increases 
the dispersion about the MPI. To facilitate such inquiries, a list of the 
types of defect likely to lead to excessive deviations in each of the seven 
control-chart quantities may be used (see that below, p. 140). 

If the 0.01 control limits are exceeded repeatedly, the investigation 
should be extended to the factory. Intensified shop inspection should 
also be applied. In extreme cases the remaining sights in a lot repre¬ 
sented by defective sights should be recalled or repaired. If this happens, 
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the shipment of further sights prior to completion of air testing of samples 
should be suspended until the factory has again established a satisfactory 
record; and the factory should revert to the status of a new factory, which 
implies that 40 consecutive sights should successfully pass the air test. 
Exact criteria as to when this should be done will need to be formulated in 
the future on the basis of further experience. 

When the rate of production of bombsights is to be reduced, the 
average showing of a factory on the control charts should be taken into 
consideration, in comparison with that of other factories. The average 
value of T% is probably the best criterion that can be drawn from the 
coordinates of the bombs dropped in air testing for deciding which factory 
should be eliminated. 

The control chart should not, however, be looked upon primarily with 
a view to drastic action. Tts great value, demonstrated in other indus¬ 
tries, is in detecting trends in quality and calling attention to deteriora¬ 
tion while it is still incipient and before it becomes serious. When this 
value is recognized, the attitude of manufacturers changes from hostility 
to appreciation. In order that these desirable results be realized, it is 
essential that copies of control charts be transmitted promptly to manu¬ 
facturers, and that attention be called to peculiarities revealed by them. 
Such peculiarities include not only the crossings of the 0.01 limit that call 
for investigations or more serious action, but also cases of exceeding 
the 0.05 limit more than about once in 20 times, or of progressive 
drifting up toward the control limits, or of excessive variation between the 
performances of the same sight or of two sights from the same lot, as 
indicated by the yardsticks. The meaning of each of the seven control- 
chart indicators should be made clear to all concerned. 

The yardsticks are designed to facilitate visual discernment of tenden¬ 
cies to drift before the control limits are reached and also to help in 
observing abrupt changes. A little practice and familiarity with this 
type* of chart should make it possible to detect such changes more 
promptly than would otherwise be easy. Prompt detection of trends 
and anomalous tendencies can be of great value if the factory is notified 
at once, since trouble can often be found and corrected in this way before 
it becomes serious. It is to the interest of the government and the 
manufacturer alike to prevent serious trouble by early corrective action 
of this sort, rather than to wait for it to develop to the point at which 
bombsights will be rejected. 

Of all the indicators, To is the one likely to give the sharpest dis¬ 
crimination between cases in which sights are really defective or the 
bombing for any reason poor, and on the other hand cases of pure bad 
luck. It combines the evidence from all the others and can serve as the 
deciding agent in marginal cases in which the others seem to disagree on 
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the question whether the bombing was good or bad. In a more spe¬ 
cialized way, T m combines the evidence of f and d , while T D combines 
that of s r and s d . It is for diagnosis of the trouble rather than for evi¬ 
dence of its existence that the main reliance should be placed on the charts 
for means and variances. 

Investigations prompted by excessive control-chart deviations will 
be assisted by a list of possible causes under each of the seven control- 
chart measures. Such a list is best drawn up by persons in continuing 
close relation to the manufacture, testing, and use of the sights and may 
need modifications from time to time with changes of model and in the 
testing procedure. The list below is purely tentative and needs to be 
supplemented by further suggestions drawn from personnel concerned 
with manufacture, shop testing, air testing, and maintenance. 

In order that such suggestions b^ made intelligently, it is necessary 
that the personnel involved have somo general understanding of the 
meaning of each of the seven measures. The following definitions may 
be sufficient for most of those concerned: 

f Range coordinate of mean point of impact 
3, Deflection coordinate of moan point of impact 
s r A measure of dispersion in range 
Sd A measure of dispersion in deflection 

Tm 4 measure of distance from mean point of impact to target, 
modified in accordance with probability principles to allow for 
the greater deflection variance 

T d A measure of dispersion about the MPI combining range and 
deflection errors 

To An over-all measure of errors of all kinds 

It should be understood that each of the T measures combines range 
with deflection errors, and that T 0 combines all the other measures into 
one. If a particular defect in a sight, or in the testing process, can affect 
only range errors it is more likely to be detected through f or s r than 
through any of the measures that include deflection errors. In the same 
way, if a particular defect can affect only deflection errors, it is more 
likely to be found from the d or s d chart than from any of the T charts. 
However, any defect that causes both range and deflection errors may 
sometimes be detected through one of the combined T measures when it 
would be impossible to distinguish it from mere chance on the basis of 
the charts of range or deflection errors alone. Similarly, any cause 
affecting both MPI and variance can sometimes be brought out clearly 
by To when none of the others would make it definite. 

In making up such a list, it appears better to consider the various 
possible sources of trouble and mark each with one or more of the seven 
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symbols, later rearranging to get lists in the form below, than to begin 
with the seven control-chart measures and try to think of all the possible 
causes. 

The following partial and tentative list should be understood in the 
terms of the preceding remarks. Under each control-chart measure are 
listed several causes, any one of which might lead to an excessive value 
for the measure. Each factor is listed only under the measure or meas¬ 
ures most likely to lead to its detection, though it may also have minor 
effects on other measures and will be reflected in more comprehensive 
measures. 


f 

Grease on disk 
Worn disk 
Faulty trail plate 
Release mechanism slow or fast 
Bombardier’s reaction slow or fast 
Loose telescope cable 
Faulty sight gyro 
Wrong disk speed 
Loose altitude knob 
Slipping of scale on altitude drum 
Faulty or wrongly adjusted telescope 
drive governor 

d 

Faulty directional gyro 
Stabilized sector off center 
Pilot director scale displaced 
Pilot director rheostat inaccurate 
Pilot director brush worn 
Crosstrail mechanism inaccurate 

Sr 

Erratic release mechanism 

Erratic behavior of bombardier 

Grease on disk 

Roller or disk not smooth 

Roller not accurately centered 

Loose telescope cable 

Loose range knobs 

Loose altitude knob 

Faulty sight gyro 

Faulty telescope drive motor 


Sd 

Loose course knobs 
Faulty directional gyro 
Stabilized sector off center 
Slow reaction by airplane 
Slow reaction by pilot 

rn 
1 M 

Any combination of factors affecting 
f and d 

Wrong altitude 
Wrong air speed 

Faulty direction of optical axis of tele¬ 
scope 

Sight gyro inaccurately leveled, perhaps 
because of inaccurate bubbles 

To 

Any combination of factors affecting 
Sr and Sd 

Loose altitude knob 
Sticky bearings 
Variable winds 
Poor visibility 
Bumpy air 

To 

Any combination of factors affecting two 
or more of the other measures 
General slipshodness or lack of skill in 
either manufacture or testing 
Variable winds 
Faulty sight gyro 


2.4. Experimentation 

This plan may readily be adapted for efficient combination with 
experiments for determining the relative excellence of different models 
of sights and types of bombing, or for investigating other questions relat- 
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ing to bombing. Considerable economy in the total amount of bombing 
required should be obtainable by a proper design for experimenting 
simultaneously on various types of equipment and technique while at 
the same time carrying out the routine test bombing with full accuracy. 
The amount of bombing required will be greater than for the routine 
testing alone or for the experiment alone, but less than the sum of the 
two. 

To ensure valid results from such combined enterprises, it is important 
to verify before the experiment is carried out that it satisfies the criteria 
of the modern statistical theory of design of experiments. It may be 
difficult or impossible to salvage any useful infoimation from the results 
of an improperly designed experiment. Many designs are possible for 
efficient combined experiments. The selection of a particular design is 
best made in relation to the objects of a particular experiment. 

As an illustration of a simple type of combination of experiment 
with routine testing, suppose that some new type of auxiliary equipment 
is to be examined (1) for its effect on the range component f of mean point 
of impact, (2) as to its effect on the standard deviation s r in range, and 
(3) as to its effect on general accuracy. Instead of two flights per sight, 
four flights might now be made fur a time with each sight. Two of 
these flights are to be on one day of a 4-day period and two on another. 
Of the two flights with a sight on the same day, one, chosen by tossing a 
coin, is to carry the new equipment and the other is not. A valid analysis 
of variance of range can be carried out from the results to answer 
question (1), and analogous procedures to answer (2) and (3) by means 
of s r and T 0 , respectively, though the analysis for T 0 must either involve 
approximations or wait for mathematical research on the subject to 
progress a little further than the point heretofore reached. The accuracy 
of determination of the differences made by the new equipment will be 
fully as great as if the entire enterprise were devoted wholly to this 
question. From this standpoint the information relevant to the routine 
testing program is obtained free of charge. 

For the rating of men and equipment routinely employed in the 
testing of bombsights, it is possible to utilize the results without any 
modification of design other than objective randomization in the assign¬ 
ments of the various flights. This randomization should be restricted 
by conditions of balance and orthogonality in order to have the most 
accurate ratings possible and to simplify calculations, but these condi¬ 
tions are not essential. One simple type of rating would be merely the 
average of the values of T% for the flights involved. However, the use 
of new types of equipment or personnel not used in the base period from 
which the control limits were determined will invalidate these control 
limits and should not be introduced into the air testing of bombsights 
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without a suitable modification of design. Similarly, altitude and other 
conditions attending the test bombing should be kept as nearly as possible 
the same as in the base period until a deliberate decision is reached that a 
change should be made and maintained over a considerable period. In 
such an event, it will be necessary to begin over again the process of 
accumulating data for new control-chart limits. 

3. FORMULAS AND CONSIDERATIONS AFFECTING REVISIONS OF 

THE PLAN 

3.1. General Considerations 

3.1.1. Occasions for Revision. —New information and changing condi¬ 
tions will from time to time call for revisions of the sampling plan specified 
in Sec. 2. Among the changes likely to be appropriate are alterations 
in the size of lots, the proportion of a lot that is to be air-tested, the number 
of flights with each sight, the number of bombs dropped per flight, and 
the estimates of the variances and covariances of range and deflection 
errors from which control limits and yardsticks are calculated. 

Another kind of change that may be developed is the use of improved 
types of control charts for shop inspection, together with the integration 
of shop-inspection procedures with air testing in a more systematic way. 
For such improvements the same statistical methods that are here 
recommended for air testing may be applied to shop testing, or to a 
combination of shop and air testing. Such a development would mean 
that instead of the two types of measurement, range and deflection 
errors, it would be necessary to take account of a larger number of 
measures on each sight and to make appropriate combinations of the 
various measures. The mathematical methods of multivariate statistical 
analysis are capable of extension with a little research to much more 
general situations involving coordinated testing at different stages. 
For this reason, and for the sake of other analogous applications, certain 
formulas will attain greater generality by being written in a form suitable 
for ready extension to more than two variables. Consequently, x\ and x 2 
will sometimes be used instead of r and d for range and deflection errors, 
and subscripts will be used which take only the values 1 and 2 in the 
present chapter but which may be understood to take other values as well 
in cases in which more measurements are made. For example, in place 
of the a, 6, and c introduced in Sec. 2.2.6, when generality is sought In, I 12 , 
and Z 22 , respectively, will be used, so that such a formula as 

T 2 = ar 2 + 2 brd + cd 2 

may also be written 

T 2 = l\\x\ + 2 I 12 X 1 X 2 + Z 22 x| 

or, more compactly, 

T 2 = 2 XUjXiXi 
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with the understanding that the first 2 stands for summation with respect 
to i and the second 2 with respect to j. In this chapter, the summations 
will be over the values 1 and 2 only, but in the extensions envisaged 
they may be over any set of values. Our results will be in such forms as 
to be readily extended to the more general cases. 

Still another type of revision of this and other sampling plans is 
likely to be required for sequential analysis, by which the decision as to 
the number of tests to be made is not fixed at the beginning of the testing 
but is postponed and determined by the gradually accumulating data. 
Considerable economies are possible by use of a well-worked-out system 
of sequential testing. Methods of applying sec uential analysis to the 
particular problem of this chapter have not yet been developed.* 

3.1.2. Numbers of Sights, Flights, and Bombs. —In the plan suggested 
in Sec. 2, the proportion of eights to be sent to the proving ground was 
taken as one-tenth simply because this has been the proportion sent in the 
past and because the shop-testing facilities appear to be adequate for 
dealing with this proportion. A continuing rating of the inspection at the 
factories in comparison with the results of the shop and air tests at the 
proving ground, and a consideration of the needs of the service, might lead 
to the decision to send either a larger or a smaller proportion of sights to 
the proving ground. If it should be decided to increase the proportion of 
sights sent to 15 or 20 percent of the total, this could be accomplished by 
sending 3 or 4 sights from each lot of 20 with a minimum of disturbance of 
the sampling plan. If, on the other hand, it should be decided that the 
inspection at the factory is more adequate than has been felt to be the case 
in the past, and a smaller proportion of sights is to be sent, the best way 
to do this would be to increase the size of the lot. It is desirable to 
have the lot large enough so that, with the proportion adopted, at least 
two sights chosen at random from each lot shall be sent to the proving 
ground. The reason for this is that there should be available a con¬ 
tinuing measure of variation within lots, and this requires at least two 
sights from each lot. Such variation is displayed by the control-chart 
scheme of Sec. 2 along with the measures of average quality. 

If the number of sights taken from each lot should become larger, 
measures of this intra-lot variation could be calculated easily and accu¬ 
rately, but the graphic portrayal by circles and crosses might not be so 
clear, and vertical segments proportional to calculated measures of 
variation might need to be used instead. If collateral experimentation 
on methods of bombing, auxiliary equipment, and so forth, is to be com¬ 
bined efficiently with test bombing, there is an advantage in having more 

* See Chap. 17 for a brief account of sequential analysis. For fuller accounts see, 
on applications, Statistical Research Group, Sequential Analysis, and on theory, Wald, 
“Sequential Tests." 
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than two sights chosen at random from each lot, as the larger numbers 
make possible the design of more efficient experiments. Against this 
consideration is to be set the fact that if the lots are large, as would be 
implied by having a large sample constituting a fixed percentage of a lot, 
there might be an excessive delay at the factory in the accumulation of 
enough sights to make a lot so that the random-sampling process could 
be applied. This objection to large lots would apply particularly where 
the rate of production of a factory is rather small. This matter is 
touched upon further in Sec. 4.2. The proportion of sights received at 
the proving ground to be air-tested has been arrived at not by calcula¬ 
tion but somewhat arbitrarily from a general survey of the practices 
prevailing in the past and the opinions of those most closely in touch 
with various phases of the situation. 

It is essential that at least two flights be made with each sight in 
order that a measure of variation due to the weather and chance factors 
may be available. The number of bombs dropped per flight has been 
taken to be four, because that was the most commonly used number in the 
period just before this study was begun. However, since six ore-filled 
test bombs can be dropped on a flight and since additional accuracy is 
gained by this practice, it may prove desirable to modify the plan to 
fit this or some other number N of bombs dropped on a flight. Increased 
accuracy can also be obtained by increasing the number of flights per 
sight. In the light of data accumulating under the sampling plan, the 
question of what combination of numbers of flights per sight and of 
bombs per flight gives the most economical method of obtaining the 
requisite accuracy needs to be investigated from time to time; or (what 
is somewhat the same thing) what combination gives the maximum 
accuracy obtainable with the resources available. Such a combination 
can only be arrived at by a study of comparative costs in conjunction 
with the relation of variance between flights to variance between bombs 
on the same flight. 

3.1.3. Revision Based on New Experience. —Revision of the variances 
and covariance of range and deflection errors may need to be made 
because of changes in techniques of bombing; it should eventually be 
made anyhow in order to have more accurate estimates of these param¬ 
eters than are at present available. From a long series of reasonably 
homogeneous control-chart data, the variance of range s 2 may be esti¬ 
mated as the mean of s 2 for the recorded flights, and the variance of 
deflection si similarly. There is a need also for an estimate, which we 
denote by s r d , of the covariance of range and deflection errors. For a 
single flight with N bombs such an estimate would be 

S(r — f)(d — S) __ Srd — fSd 
N - 1 ~ N-l 
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but the average of these estimates over the long series should be adopted. 
(If N varies among the flights, the ratio of the sum of the numerators to 
the sum of the denominators should be used.) It is not necessary 
actually to compute these sums of products if all the calculations going 
into the control chart have been correct and are available, since the 
products have already been computed in the process of obtaining values 
of T\ Indeed (Sec. 2.2.6), 

2**, = aS(r - f F y + 2 bS(r - f F )(d - d F ) + cS(d - R F ) 2 


If we sum for m flights, each with N bombs, and use the expressions for s 2 
and sj, we may solve and obtain as our new estimate of covariance 


The number of degrees of freedom derived from each flight is N — 1, and 
therefore from the m flights in the sample it is m(N — 1). We shall put 
n = m(N — 1). This important number, which must be used in the 
determination of control limits, yardstick lengths, and various tests of 
statistical significance, should be recorded carefully. 

The revised values of the coefficients for T 2 are 



Sj_ 7 -Srrf 

«2 o2 __ q 2 q 2 _ n2 

<>r S d ®rd ”r*d S rd 


C = 



S 


2 

rd 


The nature of these formulas will perhaps be clearer by reference to a more 
general case. If, instead of range and deflection errors, we were meas¬ 
uring a considerable number of features of each sight and combining 
these into one over-all indicator, we should use for T 2 the last formula 
on p. 142, with each coefficient Uj equal to the cofactor of the covariance 
between the ith and jth measures in the determinant of such covariances, 
divided by this determinant. In the formulas for a, 6, and c above, the 
common denominator is the covariance determinant, while the numer¬ 
ators are the cofactors of the elements of this determinant.* 

After two or three years of test bombing in a fairly uniform manner, 
it should be possible to obtain useful measures of seasonal variation to 
be applied as corrections. Even without this delay, it may be possible 
by a special statistical study to work out corrections of the variances to 
allow for the weather. The effect of any such corrections upon the 
number n of degrees of freedom should be noted. 

* The reader unfamiliar with determinants can readily learn enough about them to 
carry out these instructions from the chapter on determinants in any book on college 
algebra, or from Chap. VII, u Determinants/ 7 in Crum and Schumpeter, Rudimentary 
Mathematics . In the determinant of the covariances *</, the elements corresponding 
with i - j are simply variances; thus sjj - «*. 
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3.1.4. Assumptions Involved. —The control limits on the charts sup¬ 
ply a test of the hypothesis that the sights of a particular lot are as good 
as the sights used in the base period. The limits are so determined that, 
if the hypothesis is true, the 0.05 limit will in the long run be exceeded 
approximately once for every 20 lots, and the 0.01 limit approxi¬ 
mately once for every 100 lots. The functions plotted on the control 
charts have been chosen in accordance with the findings of statistical 
theory as those most likely to lead to these limits being exceeded if the 
sights tested are in fact worse than those of the base period. 

These statements are, to be sure, based on a number of assumptions, 
but the assumptions appear reasonable. It is assumed that the different 
bomb drops are statistically independent or that corrections are applied 
that take care of any lack of independence. If the randomization proce¬ 
dure prescribed in Sec. 2 is followed, and especially if suitable corrections 
are made for weather, this assumption should fit the facts closely as 
regards test bombing done under the plan. Whether true independence 
can be ascribed to the actual bombings used in calculating the initial 
control limits may be questioned, since no scheme of objective randomiza¬ 
tion was in effect at the time and since no corrections were made for 
weather apart from the selective effect of ^there being no bombing in the 
worst weather. It is the writer's impression, based on inspection of the 
procedures and records, that lack of independence in these bombings is 
not such as to make any great difference in the control limits. The 
weather factor may be more serious, since the test bombings of this base 
period were all made in November and December. However, the charts 
kept at the proving ground of earlier bombings under different conditions 
failed to disclose any obvious evidence of any marked seasonal variation 
in results. 

It is assumed that the bombing errors in range and deflection have 
a bivariate normal distribution. This assumption has a certain a priori 
justification in the notion of the errors resulting from accumulation of a 
large number of independent errors and has also an approximate a 
posteriori verification in various records of bombing. There is, to be sure, 
a slight skewness in range errors, which can be explained by a small 
proportion of bombs having defects that cause them to fall short with a 
bias peculiar to this small group. However, the deviations from nor¬ 
mality appear to be so small as to have no appreciable influence on the 
final results, so far as can be judged from general statistical considera¬ 
tions. Certainly there is no similarity between the recorded curves of 
bomb distribution and the sharp-peaked theoretical ones for which 
mean deviations provide efficient measures of dispersion. 

No assumption is made in this chapter either that correlation of range 
with deflection errors exists or that it does not. The records used in 
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Sec. 5 indicate that little or no correlation exists, though other records 
consisting of compilations of huge quantities of heterogeneous material 
do superficially indicate high correlations. The high correlations in 
these cases can probably be ascribed to the mixing of records of bombings 
at many different places and times, with various kinds of equipment, with 
personnel of widely different degrees of skill, together with faulty records 
whose complete compilation proved impossible. Even if there were a 
substantial correlation, it is not clear that this would be a serious detri¬ 
ment if the variances of the errors were moderate. The use of T 2 in the 
plan is equally valid whether correlation exists or not. 

In selecting the T functions, it is assumed that no bias in any special 
direction is to be guarded against, but that equal precautions are to 
be taken against biases in all directions. Proofs of the optimum proper¬ 
ties of T have been given by Hsu and by Wald.* 

It is assumed also that exact probabilities are to be represented 
by the control limits, free f/om approximations of unknown degrees of 
accuracy. This consideration eliminates some plausible functions, such 
as the mean, or mean square, of the actual distances of the bombs from 
the target. While these functions have an intuitive appeal and seem 
at first glance to represent the real value of the bombsight because 
closeness of bomb to target is the criterion of effectiveness in action, they 
are not suitable for use with proof bombing. They are inefficient 
even for large samples by comparison with T%, and for small samples 
they suffer from the impossibility of determining exact probabilities 
relative to them (see Sec. 4.8). 

3.2. Means 

3.2.1. Control Limits for Means. —The outer control limits for the f 
chart are so chosen that, if the distribution of range errors is the same as 
in the base period, the probability is 0.01 that the mean f of the range 
errors found with the sights of any particular lot should be represented by 
a point falling outside these limits. The inner control limits are deter¬ 
mined in the same way except that the probability is 0.05. The 
heights of these lines in terms of the scale of mils on the chart are given 
by the formula 

f ± l ‘ 8 ryjn l + ghN 

in which the following notation is used: 

f = mean of range errors in base period. (Errors “over” are con¬ 
sidered as positive, errors “short” as negative.) 
s' = estimated standard deviation of range errors in base period, 

* Hsu, “Generalised T”\ Wald, “Tests Concerning Several Parameters.” 



148 


TECHNIQUES OF STATISTICAL ANALYSIS 


Chap. 3 


consisting of the square root of the ratio of a sum of squares 
with n degrees of freedom to n 

n f — number of range errors averaged in base period to obtain ?' 

N = number of bombs dropped per flight in the period for which 
the control chart is to be used 

g = number of flights per sight in the,period for which the control 
chart is used 

h = number of sights per lot in the period for which the control 
chart is used 

€ = 0.01 for outer control limits, 0.05 for inner limits 

U = value of ratio of deviation in a mean to its estimated standard 
error (with n degrees of freedom) exceeded in absolute value with 
probability e 

With the additional notation 

d ' = mean of deflection errors in base period (errors to the right 
considered positive, to the left as negative) 

s' d = estimated standard deviation of deflection errors in base period 

we have the parallel formula for control limits for d L 


d! 



1 

ghN 


The base period considered in Sec. 5 yields the following numerical 
values: 


f = 1.3450 mils 
s' — 6.8040 mils 
ri = 260 


d' = 2.8046 mils 
s' d — 8.3686 mils 
n = 256 


According to the specifications in Sec. 2.1, g = h = 2 and N = 4. Values 
of U can be found in tables of the Student distribution.* However, 
these tables do not extend to the value n = 256. For such large values 
tt may be obtained as the first two or three terms of the asymptotic 
series 

/ = Ji . * 2 + 1 , (* 2 + 3) (5x 2 + 1) 

€ L 4n 96n 2 

where x is the unit normal deviate exceeded in absolute value with 
probability €.f We thus find forn = 256 

* See Fisher and Yates, Statistical Tables, Table III. This shows t t to 3 decimal 
places for « = 0.001, 0.01, 0.02, 0.05, and 0.1 to 0.9 by steps of 0.1, for n = 1 to 30 by 
steps of 1, 40, 60, 120, and «. Also, see Merrington, “ Percentage Points of the 
f-Distribution.” This shows t € to 5 significant figures for e = 0.005, 0.01, 0.025, 
0.05, 0.10, 0.25, and 0.50, for the same values of n used by Fisher and Yates. 

t Values of x may be found from Table 1.1 in Chap. 1, provided «/2 is used in 
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to. 01 = 2.59514 * 0.05 = 1.96925 

Substituting these values gives control limits for f L as follows: 

0.01 limits: 1.3450 ± 4.5481 = 5.8931 and -3.2031 

0.05 limits: 1.3450 ± 3.4512 = 4.7962 and -2.1062 

For d L , in the same way, we find 

0.01 limits: 2.8046 ± 5.5940 = 8.3986 and -2.7894 

0.05 limits: 2.8046 ± 4.2448 = 7.0494 and -1.4402 

3.2.2. Asymmetry of Control Limits and Bias in Base Period. —It may 

seem strange that the control limits are not symmetrical with respect 
to the zero line on the f and 3 charts, since sights whose use gives mean 
points of impact close to the target seem more desirable than those 
yielding more remote MPI's. A lik,i question arises in connection with 
T m and T 0 . However, • it should be emphasized that our comparisons 
are not with a theoretical ideal but with the actual performance of a 
base period in which the testing procedure is presumed to be comparable 
with that in the period for which the control chart is to be used. The 
material on which our records are based shows biases in both range and 
deflection errors in excess of any that could reasonably be ascribed to 
chance. Whether these biases resided in the sights, in the habits of the 
test bombardiers or pilots, in some other feature of the test bombing, 
or in the observation of the results we do not know. The sights used 
were accepted. If the bias was the fault of the sights, this fact should 
be ascertained by an ad hoc inquiry and the manufacturing design changed 
accordingly. If the bias was a result of something in the test bombing 
other than the sights, and if this condition continues in the period for 
which the control charts are to be used, it is necessary that the control 
limits be asymmetrical with respect to the zero line in order to avoid 
wholesale false alarms regarding sights that are really quite satisfactory 
but with which the test bombs fall moderately farther over and to the 
right than the average. If the proof-bombing procedure is such as to 
create the observed bias, a sight should be labeled unsatisfactory much 
more readily for bomb errors to the left and short than for errors of the 
same magnitude in the direction in which the bomb errors are usually 
made. Use of the asymmetrical limits is equivalent to applying a 
correction to each bomb coordinate to compensate for the bias observed in 
the base period. 

entering the table. The value of x for c » 0.01 is 2.57582930, and for e =* 0.05 it is 
1.95996398. 

The series for t € in terms of x is equivalent to a series for x in terms of U given by 
Hotelling and Frankel, “Transformation of Statistics,” and by Goldberg and Levine, 
“t and x*”; the first two terms are also given by Peiser, “ Significance Levels." 
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If a change in bombing procedures, or in methods of observing the 
positions at which the bombs fall, results in elimination or alteration of 
the observed* bias, the control limits should be changed accordingly. 
Until this is done they should be based on experience. 

If an investigation leads to the conclusion that freedom from bias 
has been achieved in the test bombing and is likely to continue, the control 
limits for f and d should be symmetrical about zero, at the same distances 
as from the central lines previously indicated. At the same time, T M 
should be replaced by T M , and f 0 by T 0) both as to calculated values and 
in the labeling of the control chart. 

Other considerations involving the fallibility of the base-period sample 
arise in connection with the effect on the distance between upper and 
lower control limits of the number of bombs in the base period. The 
reciprocal of this number n' is one of the terms under the radical sign in 
the formulas for the control limits, so that if the control limits are based 
on a period in which very few bombs were dropped, the limits will be 
relatively far apart. In such a period t € will also be slightly larger than 
if numerous bombs were dropped and will push the control limits 
still farther apart. This fact is a consequence of our ability to make 
clear comparisons only with past observations. If the preliminary 
sample with which we compare a new sample is small, the possibility of 
establishing a difference when one really exists is diminished. 

The effect of n' on the control limits is rather slight for numbers of 
any such orders of magnitude as we have been considering. There 
seems no reason here why the control limits should not be revised after 
every 200 or 300 bombs, if it is thought that fundamental conditions 
have changed in such an interval. A more serious objection to such 
frequent revision for the present is the lack of exact probabilities when n 
is small. 

3.2.3. Yardsticks for Differences of Means. —Unlike the control 
limits, these yardsticks are unaffected by the bias in the MPI's of the base 
period. The lengths of the yardsticks for means of flights, sights, and 
lots are, respectively, 

2 , 

gN s to * 08 

where s' is to be replaced by $' r for the range yardsticks and by s' d for the 
deflection yardsticks, and the value of to.oh is to be determined with 
reference to the value of n underlying the estimates of variance in the 
base period. For g = h = 2, N = 4, and the same values of s' and s' d 
Used previously on the basis of experience, these expressions take the 
following values, which are the lengths of the yardsticks shown in Figs. 
3.1 and 3.2: 
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1 

Flights 

Sights 

Lots 

Range. 

9.4741 

6.6992 

8.2397 

4.7370 

5.8263 

Deflection. 

11.6527 



3.3. Variances 

3.3.1. Control Limits for Variances. —The control limits on the charts 
for s 2 and sj are fixed in accordance with the distribution of the ratio of 
two independent estimates of the same variance. This ratio is usually 
denoted by F> since R. A. Fisher was the first to work with the distribu¬ 
tion and to show its extensive usefulness in statistical work of many 
kinds.* 

To obtain the 0.01 control limit for s 2 rf we put s 2 /s ,2 = F, and obtain 
from the table of 1 percent points the value of F corresponding to 

1 ni - yht W — j ) and n 2 = n 

From this value of F and the value s' 2 found from the base period we 
obtain s 2 y the upper control limit. According to the plan specified in Sec. 
2.1, g = h — 2 and N = 4, so that ni = 12. For the base period of 
Sec. 5, n = 256 and s' 2 = 46.295. From the tables it is found that, for 
ni = 12 and n 2 = 256, the 1 percent point of F is approximately 2.2544. 
Multiplying this by 46.295 gives the value 104.4 as the 1 percent control 
limit for s 2 r . 

The numerical values for the control limits on the s 2 scale of Figs. 
3.3 and 3.4 are found in this way and given below: 



0.01 limit 

0.05 limit 

Range . 

104.4 

157.9 

82.9 

125.4 

Deflection. 



These control limits take full account of the fact that the variances 
derived from the preliminary sample are not exactly those of the popula¬ 
tion but are based on 256 degrees of freedom instead of an infinite number. 
If the same variance estimates had been obtained from a preliminary 
sample of, say, 1,000,000 degrees of freedom, the control limits for range 
variance would have been reduced to 101.2 and 81.1, with proportionate 
reductions in the case of deflection. 

* For a table of F, see Fisher and Yates, Statistical Tables , Table V, where F is 
called “the variance ratio.” This table contains a footnote which, since it may be 
misinterpreted, should be disregarded in following the instructions in this chapter. 
A more extensive table of F is given by Merrington and Thompson, “Percentage 
Points of the Inverted Beta (F) Distribution.” 
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3.3.2. Yardsticks for Variances.—The yardsticks on the variance 
charts have no relation whatever to the base period but depend only on 
the numbers of degrees of freedom in the current variance estimates 
that are to be compared. Their lengths are the same for the charts of 
range variance and of deflection variance. The vertical distance between 
two plotted points has probability 0.05 of exceeding the length of the 
appropriate yardstick if the variances plotted are obtained from samples 
from the same hypothetical infinite population. If F is the ratio of two 
such sample variances with equal numbers of degrees of freedom, log F 
has a symmetrical distribution extending from minus to plus infinity. 
The value of log F corresponding to the length of the yardstick is such 
that its probability of being exceeded in absolute value is 0.05. Hence, 
the probability of a random value being actually greater than this is 
0.025, and the probability of its being less than the negative of this yard¬ 
stick value is also 0.025. The yardstick length would thus correspond 
to the 2}<2 percent point of the F distribution. * 

For each flight the number of degrees of freedom for the differences 
among the range coordinates of the N bombs is N — 1; this is equal to 
the number of degrees of freedom of the deviations of these N numbers 
from their mean. The variance estimates for sights and for flights are 
obtained by adding the sums of squares of deviations from the respective 
flight means and dividing by the sum of the numbers of degrees of 
freedom. Thus the number of degrees of freedom in the variance esti¬ 
mate for a sight is g(N — 1), while for a lot it is gh(N — 1). 

For the specified values g = h = 2, N = A the numbers of degrees 
of freedom for flights, sights, and lots, respectively, are therefore 3, 6, 
and 12. The critical values of F in these three cases are 15.439, 5.8197, 
and 3.2773. On the logarithmic scale of s 2 at the left of Fig. 3.3 the 
heights from 1 to these three numbers are the lengths of the yardsticks. 
On Fig. 3.4 the lengths are the same. 

3.4. Comprehensive Measures 

3.4.1. Control Limits for the T } s. —When the variance estimates for 
the base period utilize a large number n of degrees of freedom, such as 
the value 256 here used, the distribution of T 2 is approximately the same 
as the x 2 distribution that has been tabulated and widely used for a 
variety of statistical purposes, f This distribution depends on a number 
of degrees of freedom /, which may be defined in the following manner. 
For each bomb the measure of discrepancy from the target 

* The 2 \i percent points of F are not given by Fisher and Yates but are given by 
Merrington and Thompson. 

f For tables of the x* distribution, see Fisher and Yates, Statistical Tables, Table 
IV; or Thompson, “Percentage Points of the x* Distribution.” 
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Tq B = w 2 + 2 brd + cd 2 

has (approximately for large n) the x 2 distribution with 2 degrees of 
freedom, provided the bomb is dropped under essentially the same condi¬ 
tions as the bombs in the base period, and provided the probability dis¬ 
tribution is actually centered at the target without bias. The sum of 
any number of independent values of x 2 has the x 2 distribution with a 
number of degrees of freedom equal to the sum of the numbers for the 
several values of x 2 added. Thus T ( %, the sum of the N values of Tl B for 
the N bombs dropped in a flight, has approximately the x 2 distribution 
with 2 N degrees of freedom. Since g flights are made with each sight, 
Tqs has 2 gN degrees of freedom, and since h flights of each lot are 
tested, Tl h has 2 ghN degrees of freedom. Since in the specific plan of 
Sec. 2.1, g * h = 2 and N « 4, control limits for T 2 L may be obtained 
approximately from the 0.01 and 0.05 probability levels for x 2 with 
32 degrees of freedom. 

Under this same condition of absence of bias, f F y/N and d F y/N 
have the same joint distribution as r and d. Consequently 

T% f = N (af 2 + 2 bfd + cd 2 ) 


has, with the same approximation as before, the x 2 distribution with 2 
degrees of freedom. For the gh, flights involved in testing a sample lot, 
the sum T 2 ML of these quantities has approximately the x 2 distribution 
with 2gh degrees of freedom. For the plan of Sec. 2.1 this means 8 degrees 
of freedom for T 2 ML . 

The distribution of a sum of squares of deviations from a mean is 
independent of the distribution of the mean itself when, as is assumed 
throughout, the fundamental distribution is of the normal form. In like 
manner the sum of the products of the deviations of r and d from their 
respective means is independent of these means in the probability sense. 
It follows that 

T 2 df = aS(r — f F ) 2 + 2 bS(r — f F )(d — <1 F ) + cS(d — d F ) 2 
is independent of T 2 MF . Since 

T* M T2 — 7 T 2 
1 DF r 1 MF ~~ 1 OF 

and since TMF has 2 degrees of freedom while T OF has 2N , it follows that 
T 2 df has approximately the x 2 distribution with 2(iV — 1) degrees of 
freedom. Since T% L is the sum of gh independent values of T% Fy the 
number of degrees of freedom for T DL is 2 gh(N — 1), a number which for 
the plan of Sec. 2.1 equals 24. 

It is assumed in the foregoing paragraphs that r and d are free from 
bias. If there is a bias, the same in the new as in the old bombings, 
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r — f and d — <2' will be free of bias but will have variances 1 + l/n' 
times those of r and d, respectively. (Here n' is the number of bombs in 
the old sample, equal to 260 for the base period considered.) Hence, if 
we replace r and d , respectively, by 

r — f' , d-d' 


- W11U. ,- 

i+ ? V i+ * 


in T % b , the resulting statistic 

_ o(r - f') 2 + 2b(r - f')(d - d') + c(d - 3,')* 

J- OR .~ ‘ 


has the distribution that T 2 OB would have in the unbiased case. The sum 
f l L of Ngh values of f 2 OB therefore has the same approximate x 2 distribu¬ 
tion with 2Ngh degrees of freedom as Tl L . 

The mean range f F for the N bombs of a flight differs from the mean 
range f' for the n' bombs of the base period by an amount whose variance 
is 1/N + l/n' times the variance of range in general. The like is true 
of the difference d F — d f . Consequently 

__ a(f F - f') 2 + 2 b(r F - ?)(d F - d') + c(d F - d,') 2 

1MV I 4. I ~ 

N + ri 


has the same approximate x 2 distribution as T 2 MF with 2gh degrees of 
freedom. 

These approximate distributions require correction for the limited 
number n of degrees of freedom of the sample in the base period. If a 
base period with a value of n less than, say, 100 were taken, the correction 
would be substantial. As the value of n increases, the exact x 2 distribu¬ 
tion is approached as a limit. 

No study of the exact distributions of the various T statistics here 
used has been published except for the simplest case. The probability 
that T ob should exceed any assigned value T is known to be* 

This is the special case for p = 2 of the distribution of the generalized 
Student ratio given by 

T 2 ~ 2 'LUjXiXj 

where hj is the element in the ith row and jth column of a p-rowed sym¬ 
metric matrix inverse to the matrix of sample covariances between x% 
* See Hotelling, “Generalization of Student's Ratio,” formula (26). 
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and Xj found in a former sample with n degrees of freedom for each of 
the p variates. The element of probability for this generalized Student 
ratio, namely, 

approaches that for x with p degrees of freedom when n increases. The 
probability integral of this distribution may be evaluated in various ways, 
one of which is to use the asymptotic expansion* 

T , fj _ V±T* 4_-_p*_+_(2 + 5 p) f» + ST* . 1 

X : L 2n + 24n 2 J 

in conjunct ton with the x 2 distribution. From it may be obtained the 
asymptotic expansion for T * in terms of x 2 , namely, 

T 2~ v 2 . V + X 2 , 7p 2 - 4 + (18 p z 2) x 2 + 4 X 4 . 1 

X |_ ^ 2^ ' " 24n 2 J 

This formula is particularly suitable for determining the values of T 2 
corresponding to any particular probability level from those already 
tabulated for x 2 - 

Probabilities connected with the T distribution, or values which will 
be exceeded with assigned probabilities, can also be evaluated by means 
of the transformation 

rp% = _ npF 

n — p + 1 

where F has the variance ratio distribution, of which tables are available, 
with rii = p and = n — p + 1 degrees of freedom. This method is 
particularly suitable for small values of n. 

This known T distribution may be used temporarily as an approxima¬ 
tion for the distributions of the T statistics used in the plan presented in 
this chapter. It has been so used in obtaining the control limits listed 
in Sec. 2.2.6, which are shown in Figs. 3.5, 3.6, and 3.7. For this approxi¬ 
mation the p of the foregoing formulas is replaced by the number of 
degrees of freedom /, which in Sec. 2 has the values 8, 24, and 32 for 
Tml, Tdl, and T OLl respectively. In general, / equals 2 gh, 2gh(N — 1), 
and 2ghN for these several cases, f 

* See Hotelling and F&ankel, “Transformation of Statistics,” p. 96. 

t An article on the mathematical theory of this chapter, which is expected to 
appear in the Annals of Mathematical Statistics , will include the exact distribution of 
these T s. 



156 


TECHNIQUES OF STATISTICAL ANALYSIS 


Chap. 3 


3.4.2. Yardsticks for the T } s.— The values of T 2 obtained from differ¬ 
ent flights are not exactly independent in the probability sense, since 
they involve the same values of a, b, and c, which are subject to the 
sampling errors of the base period, which will be small if n is large. 
For this reason it appears that the several values of T 2 will have positive 
correlations, but that these will be very small when n is as large as in the 
sample here considered. Apparently no great error will be made by 
assuming independence. 

Pending the development of a more exact treatment, we may obtain 
yardsticks for the T charts by assuming independence and attributing to 
each T the x distribution. The ratio of two independent values of T 2 , 
each with / degrees of freedom, has thus approximately the distribution 
of the ratio of two independent values of x 2 , that is, F, with ni = n 2 = /. 
Since yardsticks are desired with a probability 0.05 of being exceeded 
in length on the logarithmic scale, the tables of the 2]^ percent points 
of F may be used. The values of / for single flights are 2 N, 2, and 
2 (N — 1) for To, T m , and T D , respectively. These values of f are to be 
multiplied by g for sights and by gh for lots. 

4. BACKGROUND AND REASONS FOR THE PLAN 
4.1. Objectives of Sampling Inspection 

Apart from special experimental and exploratory purposes, two 
principal objects can be gained efficiently by means of routine inspection 
and testing of small samples, and a third object can at the same time be 
attained in a somewhat less adequate fashion. From small samples 
it is possible 

(1) To judge efficiently the general level of quality of the product of 
a factory or a part of a factory over a period, in comparison with the 
output of other productive units, and of the same unit at other times. 

(2) To detect promptly irregularities, trends in quality, and devia¬ 
tions.from customary levels. 

(3) To select and reject individual bad articles, or bad lots. This 
function is often considered the primary one. However, the efficiency 
of small samples for this purpose, when the proportion of defectives 
that can be tolerated is small, is not so satisfactory as is sometimes 
imagined. This has beeen brought out in a recent article, “Rational 
Sentencing,” by Dr. E. H. Sealy. 

On the basis of the small samples (5 to 10 per lot of 2,000) which it is possible 
to fire at ammunition proof, it is quite impossible to distinguish between the 
qualities of individual lots, or to sort out individual bad lots from the bulk in 
any but the grossest cases. Proof specifications of the type which state that 
a lot will be accepted if its proof is clear and rejected or reproved if certain defects 
appear at proof have very little effect upon the quality of the bulk being inspected. 
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This has often been demonstrated in cases where the test is nondestructive by 
100 percent inspecting the bulk both before and after proof. The percentages 
defective found at 100 percent inspection of the accepted and rejected lots are 
almost invariably found to be nearly equal and both very little different from the 
percentage of defectives in the original bulk presented for proof. In other words, 
the quality of the product passed into the service under such a sentencing system 
is practically that which happens to be made at the factory. . . . 

It is, then, quite impracticable to attempt to discriminate between individual 
lots by means of a small sample proof. W hat we can get, however, from such a 
proof is a knowledge of the average, over-all quality of the product of any given 
factory, for the accumulation of many small sample proofs amounts in time to 
bulk of evidence which there is no gainsaying. . . . 

Broadly speaking, then, the object of rational sentencing is not to sort out 
individual gjod and bad lots but to estimate qualities assumed to be reasonably 
constant, ff we estimate that, over a long period, a firm produces consistently 
a product no better than 3 percent defective, then if we need this firm's produc¬ 
tion, we must perforce accept <fc as a whole along with its 3 per cent of defectives, 
for this is the best this firm ca/i do under its present state of knowledge and equip¬ 
ment. In such a case there will be, on the average, about 60 defectives in each 
lot of 2,000, and clearly, by chance, when picking a sample at random from a lot, 
one or more of these 60 defectives will from time to time be found. But it is 
irrational to suppose that the lots, none of whose 60 defectives chanced to be 
chosen in the proof sample, are any better than the others. 

Dr. Sealy has elsewhere* shown the great difficulty of devising a 
practical small-sample scheme that will reduce to reasonable dimen¬ 
sions both the producer’s risk of having good lots rejected and the 
consumer’s risk of having bad lots accepted, when the definition of good¬ 
ness is that the proportion defective in a lot shall be not more than about 
2 percent. Ilis work is, however, confined to cases in which the result 
of the inspection of each article is merely to classify it as defective or 
passable. The crudeness of this type of inspection in comparison with 
quantitative measurements is partly responsible for the difficulties he 
emphasizes. With the relatively refined statistical methods recom¬ 
mended in this chapter for quantitative measures of bombsight quality, 
the chances of detecting a bad lot are much greater than by the simple 
procedures whose limitations he points out. The fact remains, however, 
that too much should not be expected of small samples from the stand¬ 
point of catching infrequent defects. 

One way to meet this difficulty is to increase the size of the sample 
by lumping together successive lots, while keeping the same proportion 
of inspection, so as to have larger samples. This procedure, however, 
introduces new and formidable difficulties. It is objectionable to keep 
lots of thousands on hand for the long periods required for selection and 


* “ Acceptance Sampling.” 
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testing of samples; and, more fundamentally, production conditions are 
likely to change considerably during the accumulation and testing proc¬ 
esses for such samples, so that the homogeneity necessary for accurate 
inferences is lost. 

4.2. Reasons for Random Selection and for Lot Size 20 

It is evident that the detection of individual defective bombsights, 
or small lots of bombsights, needs to be done primarily by tests in the shop 
rather than in the air. The main reliance for this purpose must be on the 
painstaking, detailed inspection of every sight made by the government 
inspectors at the factory. The inspection at the proving ground of 10 
percent of the sights can be only a partial check on the inspectors at 
the factories, incapable of reliability in detection of bad small lots or 
individual sights in any but the grossest cases. 

The real functions of testing and inspection at the proving ground 
must be taken primarily to be the two set forth at the beginning of the 
preceding section, namely, (1) the continuing appraisal of general level 
of quality, and (2) the detection of trends, excessive irregularity, and 
differences between sights produced at about the same time in the same 
factory. 

To carry out these functions it is essential to have at the proving 
ground sights chosen at random. Selection of those apparently the 
worst, as has been done in the past, introduces not only a bias, which 
could be compensated by calculation after the accumulation of experience 
with sufficient numbers chosen at random, but the much more serious 
difficulty of excessive variation. To carry out accurately the two 
leading functions of sampling inspection on the basis of samples thus 
selected is practically impossible. The worst, or the least, or the greatest, 
with respect to any characteristic, in samples of 10 or more has a distribu¬ 
tion far removed from that of the general population from which the 
samples were drawn, not only in average level, but also in variability 
from sample to sample and in the form of the distribution. It is these 
considerations that lead to the recommendation to select 10 percent of 
the output at random, in place of taking the worst 1 in 10. 

We do not recommend taking 1 in each successive lot of 10 at random, 
but rather taking 2 at random from each of successive lots of 20. This 
might be replaced by taking 3 at random from each of successive lots of 
30, or similarly with large numbers. However, the use of these larger 
lots would mean delay while they were being accumulated and would 
increase the trouble with fractions of lots remaining as tag ends with every 
change of model. An even more cogent objection to large lots is that 
the larger the lot the greater is the chance of some change in the condi¬ 
tions of production while it is being accumulated. Such a change means 
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heterogeneity within the lot, which weakens the inferences from it, and 
means that the existence of the change is not Jjkely to be detected so 
soon as if the lots were smaller. It is particularly important that 
deteriorations in quality be discovered promptly, in order that further 
defective sights may be prevented before a great number of them accumu¬ 
lates. These considerations lead to a choice of size of lot as small as is 
consistent with having a plurality of individual sights chosen at random 
from it. 

The use of 10 percent as the proportion of sights to be sent to the 
proving ground is neither defended nor opposed here but is adopted as a 
basis of calculation because this proportion has been used in the past. 

4.3. Relation to Experimental Design 

One re:ison for using i t least two sights chosen at random from 
each lot is in order to obtaix. a ready measure of variation within the lot, 
so that excessive differences can be observed with the help of the yard¬ 
sticks provided for the purpose and attention can thus be directed to 
possible heterogeneity in the productive process that might lead to 
deterioration. Another reason is the desirability of averaging together 
results from different sights of a lot in order to eliminate some of the 
chance variation and make trends and excessive errors more clearly 
discernible. 

These two reasons are features of the modern theory of the design 
of experiments, which rests on the twin principles of replication and 
randomization.* Replication (that is, repetition) has two values. One 
of these is the increased accuracy obtainable by averaging the results of 
several observations. The other, which is of the greater importance 
though perhaps less obvious, is the provision of data for estimating the 
variance due to errors of observation, data consisting essentially of 
the differences between the results obtained from experiments in which 
the controlled condition^ are the same. Knowledge concerning this 
error variance is necessary before any conclusion can be drawn, since 
real effects can only be distinguished from chance when information is 
available as to the distribution of the chance factor. It is true that 
estimates of chance variance can sometimes be drawn from other analo¬ 
gous experiments or from general knowledge; but such estimates are of 
doubtful validity when applied to a new set of data that must inevitably 
differ somewhat in origin from those provided by older experiments. 
The most satisfactory estimate of variance is that provided by the self- 
contained experiment, which is arranged from the beginning so as to 

* See Fisher, Design of Experiments , especially Secs. 9 and 10 of Chap. 2, Sec. 20 
of Chap. 3, and Secs. 25 and 26 of Chap. 4. 
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yield such an estimate at the same time as the measures of the main 
effects to be investigate^}. 

Randomization is necessary in order to make the results valid, in 
the sense that the calculated estimates are free from bias. When the 
selection of cases for particular kinds of treatment is made by any human 
being, and especially by a person familiar with the things selected, there 
is grave danger of bias, conscious or unconscious, in the assignments. 
Zeal and conscientious care in the selection are no safeguards against 
bias, as has been demonstrated repeatedly in a variety of fields. The 
principal reason for this is that attempts to get a “ representative” 
sample or set of conditions by conscious choice usually result in excessive 
variance within the group in a way that cannot be taken account of by 
any possible calculation when the probabilities necessary for testing the 
significance of the results are sought. Such difficulties can be overcome 
only through the actual utilization of the mechanisms of games of chance 
or their equivalents. The general framework of the experiment should 
be established by careful design; but within this framework the allocation 
of the particular treatments to individual cases must be strictly at 
random. 

When applied to the sampling inspection of articles coming suc¬ 
cessively from a production line, the principles of replication and randomi¬ 
zation suggest the following procedure: If 10 percent, for example, of 
the product is to be subjected to a specially intensive testing, with a 
fixed number chosen from each successive batch of some chosen size, 
these principles call for the selection at random, within each batch, of at 
least two articles. Thus, samples of 10 per cent taken in accordance 
with these principles require batches of at least 20 articles. 

A seemingly more natural and perhaps a more usual practice when a 
10 percent sample is to be taken from a production line is to take every 
tenth article. If there should be a cycle of period 10 in any part of the 
productive process, this practice would introduce a biased error into the 
averages. It also fails to provide a continuing estimation of variance and 
therefore encourages a reliance on the variance obtained from data 
available when the system was inaugurated, observations which may 
in some cases be unfortunately few in number, and which always refer 
to a past time and are therefore more or less obsolescent. 

4.4. Efficient Formulas for Utilizing Results 

The formulas given in preceding parts of this chapter have properties 
of maximum efficiency and accuracy of interpretation, provided that the 
actual distribution of bomb coordinates under homogeneous conditions 
is of the bivariate normal form. This form of distribution is strongly 
suggested by general mathematical theory based on the idea of the errors 
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as resultants of a large number of independent causes. It is approxi¬ 
mated by various records of actual bombing, the principal deviation 
being a slight skewness of the range errors due to a small proportion of 
defective bomls that tumble and fall short. For distributions having 
moderate deviation from normality the statistical efficiency of the 
methods based on the assumption of normality is still nearly ICO percent. 

For the normal distribution in one or more variates, it is known 
that all the information supplied by any random sample with respect 
to the unknown parameters of the distribution is contained in the sample 
means and mean squares of the individual variates, and in the mean 
products of pairs of variates. Only these functions of the observations, 
and functions of these functions, are capable of full efficiency in estimation 
of the unJjiowns. If any other function of the observations is used for 
estimation or for the testing of any hypothesis regarding the unknown 
parameters, there is a redu *tion in accuracy. If the use of an inefficient 
statistical method is insisted upon, the loss of accuracy can be made good 
by increasing the number of observations in a proportion inverse to the 
efficiency of the method in question. 

These general principles are illustrated by the use of the mean 
deviation, or mean of absolute values of errors, as a measure of dispersion, 
by comparison with the standard deviation, or by the median in com¬ 
parison with the mean. If a variate x , for example the deflection error 
of bombs dropped under certain conditions, is normally distributed, this 
means that the element of probability is 

i _ 

- y— e ^ dx 

(TV 2w 

that is, the integral of this expression between any two limits is the 
probability of x falling between these limits. This probability distribu¬ 
tion involves two parameters, namely, the mean value £ and the standard 
deviation <r, measuring respectively, in the example of deflection errors, 
the deflection component of the mean point of impact of all bombs 
dropped under the specified conditions, and dispersion about this mean 
point of impact. The square of the standard deviation is known as the 
“variance”; this parameter <r 2 is the mean value of the squares of devia¬ 
tions from the mean. 

When we have N observations xi, x 2l . . . , x N of the variate x ) we 
may calculate from them various functions, each known as a “statistic/ 1 
designed to estimate the unknown parameters £ and <r. Thus £ may be 
estimated by the arithmetic mean 
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or alternatively by the median ra, which is defined as the middle observa¬ 
tion (when the number N is odd) when the observations are arranged in 
order of size. * A set of N independent observations is called a “ sample.” 
If from many samples of N we form the means x, these will have a dis¬ 
tribution of their own, which happens to be normal also. The mean of 
this new normal distribution has the same value { as the old. But instead 
of the variance <r 2 , the distribution of means has for its variance the 
reduced value a 2 /N. This reduction in variance measures the greater 
accuracy in averages by comparison with single observations. 

The distribution of medians from samples of N is not of the normal 
form but approximates it if N is large. The variance of this distribution 
of medians is less than that of x but greater than that of the means x. 
The variance of medians is in fact approximately tt/2, or 1.5708, times the 
variance of means from samples of the same size when N is large. The 
inefficiency of the median is expressed by this greater dispersion, or by 
considering the relative sample numbers required to obtain the same 
accuracy by the median as by the mean. Taking the latter point of view, 
and letting No be the sample size to be used when the arithmetic mean is 
taken as the estimate of £, while Ni observations (for example, bombs) 
are to be used when the median is taken instead of the mean, we have for 
the two variances 

2 _ 0^ ? _ 7T<7 2 

** ~ No am ~ 2N'i 

If the two variances are to be equal, the ratio of the sample numbers is 

— = - = 0.63f>6 
Ni 7r 

Consequently, the efficiency of the median is said to be 63.66 percent. 
The efficiency of the mean has been demonstrated to be 100 percent; 
that is, its variance has the smallest possible value. 

The situation is similar with respect to estimates of variance. There 
are infinitely many statistics whose mean values are equal to a 2 . Among 
all these, the particular statistic 

N - 1 

(where S stands for summation over the sample) enjoys the property 
that its distribution has a smaller variance than any of the others. In 
particular, when the mean error (that is, the arithmetic mean of the 
absolute values of the deviations from the mean) is multiplied by a factor 
such that the mean value of the resulting statistic is <r, this statistic has a 
distinctly wider dispersion than s when the sample size is the same. Both 
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these statistics have distributions approaching normality of form when 
N is large, with the efficiency ratio 

—~—n = 0.87597 

7T ~ 2 

The wide current use of the mean deviation thus has an efficiency of only 
about 87.6 percent. If bombs are dropped for the purpose of ascertaining 
their dispersion, the use of the mean deviation is equivalent to wasting 
about one bomb in eight. 

Against this waste of bombs may be set the fact that the mean devia¬ 
tion is easier to calculate than the standard deviation s. The labor of 
calculating s is, however, much less than is sometimes supposed and is 
trivial if a calculating machine of any kind, or even a table of squares, 
is at hand. The identity 

S(? — x) 2 — S;J 2 — xSx 

means that the individual deviations from the mean need not be com¬ 
puted. In view of the high cost of aerial bombing, it appears definite 
that the use of mean deviations, and of functions of them, ought to be 
eliminated. * 

4.5. Exact Probabilities 

The standard deviation s defined above has another advantage over 
the mean deviation and all other methods of estimating a in that the 
exact distribution of s for samples of every size, even the smallest, is a 
known elementary mathematical function, and tables of its probability 
integral are available. It also has numerous other practical advantages 
in more complicated situations, such as those involving least squares 
or the analysis of variance. In view of the small samples that must 
sometimes be used, the property of having readily determinable exact 
probabilities is a valuable one. Asymptotic formulas are somewhat 
dubious in application to particular cases unless a precise limit of error 
is at hand and are thus inferior to exact probabilities. 

The choice of suitable statistics is also influenced by another and 
more subtle criterion of accurately determinable probabilities. This 
is the desideratum that in testing any hypothesis we should use probabili¬ 
ties that depend on nothing but the observations, together with any 
numbers that may be specified in the hypothesis. Statisticians have 
often, through lack of existence or knowledge of better methods, used 
formulas for standard errors and other measures of accuracy in which 
were involved unknown parameters. The practice has been to substitute 

* See Fisher, “Mean Error and Mean Square Error.” For a discussion of the 
general theory of the efficiency of statistics, see Fisher, “Theory of Statistical Estima¬ 
tion,” and “Statistical Theory of Estimation.” 
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for these parameters estimates of them obtained from the sample. This 
substitution however entails an additional sampling error not under con¬ 
trol. Modern statistical theory has succeeded in devising various ways of 
overcoming this difficulty. The statistical methods recommended in 
this chapter all fulfill the criterion that the associated probabilities arc 
exact, free from unknown nuisance parameters. This is true in particular 
of the measures T 0) Tm, and T D combining range with deflection errors. 
If the new observations are from the same population as the old and if 
this has a normal distribution, then the distribution of each of these Ts 
is absolutely independent of all unknown parameters. Only the sample 
numbers are involved, and these are of course known. To pass from the 
observations to the relevant exact probabilities through these T’s is a 
purely mathematical operation, involving no element of judgment. It 
is true that the available tables do not cover fully the T distribution, 
so that asymptotic approximations must be used for the present, but this 
is a defect that can be remedied with only a little more mathematical 
research and calculation. The probabilities respecting T M can be found 
easily and accurately from existing published tables when only one flight 
is involved, but the combination of these values for different flights leads 
to a more general distribution,* which is also applicable to T D and T 0 . 

4.6. Reasons for the T Method of Combining Range and Deflection 
Errors 

The particular method of combining range with deflection errors 
represented by the T formulas may be questioned. To many it will seem 
more natural to use measures, apparently simpler, based on the simple 
distances from the bombs to the target. After all, it will be said, the 
ultimate value of a bomb aimed at the enemy depends only on the simple 
geometrical distance from its point of impact to the target, and not on any 
figurative measure of distance that is constant along ellipses instead of 
circles. The simple distance has a clear meaning, intelligible to everyone. 
It can be measured on a map without reading off coordinates. Why not, 
then, use a criterion of excellence of bombing depending, in an inverse 
sense, only on the distances of the bombs from the target? 

The several distances might be combined by simple averaging, or by 
taking the mean square, or in numerous other ways. Use of the mean 
square, or the sum of the squares, of the distances, has a special attrac¬ 
tion in that the sum of the squares of the distances of the bombs from 
the target equals the sum of the squares of their distances from the mean 
point of impact, plus the product of the number of bombs by the square 
of the distance from the mean point of impact to the target. This 

* This mathematical gap has been filled in since the above text was written. The 
results are expected to appear in the Annals of Mathematical Statistics . 
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relation, which is analogous to 

n = n + r% 

makes it possible to examine separately two components of the total sum 
of squares that may be due to quite different causes. Furthermore, 
estimates of the circular probable error and similar parameters have 
greater statistical efficiency, that is, smaller standard errors for a given 
number of observations, if based on the sum of squares instead of the 
simple sum or mean of the distances. 

But the efficiency even of the mean square circular error for estimating 
the general degree of dispersion is less lhan that of To or T,>, provided 
the base period sample is representative of the true distribution as 
regards the ratios of the variances of range and deflection, and as regards 
the correlation. If the true mean p nnt of impact is unknown or if we 
choose to ignore it, the best available measure of general degree of dis¬ 
persion is T d or a function or TV If the true mean point of impact could 
bo assumed to be at the target, the best measure of general degree of 
dispersion would be To or a function of T 0 . Such measures have smaller 
variances than measures of the same parameter based on the mean square 
circular error in a ratio which, if we can trust the experience described 
in Sec. 5, is slightly less than 82 percent. This implies that the estima¬ 
tion of dispersion can be carried out with about equal accuracy with 82 
bombs if T D or To is used, or with 100 bombs if the mean square circular 
error is used. The simple mean of distances from the target or from the 
MPI, or any estimate involving mean deviations, would require distinctly 
more than 100 bombs to gain the same accuracy obtainable with 82 
bombs when T is used. 

The seemingly paradoxical result emerges that, to obtain the best 
possible estimate of the mean square deviation from the target to be 
expected in later bombing under like conditions, the mean square devia¬ 
tion from the target in the observed cases should not be used. Likewise, 
to estimate the mean deviation from the target as accurately as possible 
for the future, the mean deviation from the target in the past should not 
be used. The same is true of shooting and of all other situations in 
which the joint distribution of two or more variates does not have 
circular or spherical symmetry. Indeed, for any given set of ratios 
among the variances and covariances, the mean square deviation from 
the target and the mean deviation from the target for the population of 
bombs or shots, from which our observations constitute a sample, are 
definite multiples of a parameter measuring general degree of dispersion; 
and such a parameter is best estimated by a quadratic expression of the 
T 2 type. 

For detection of bias in the mean point of impact, the use of Tu 
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has advantages over all other methods that have been studied.* It is 
true that when only a bias in a specific direction, such as that of range 
errors, is to be considered and it is certain that there is no bias in any other 
direction, the mean of the components of deviation (positive and negative) 
in the particular direction is the most efficient measure, while the use of a 
measure of bias in all directions weakens somewhat the test for a bias 
in the particular direction. This difficulty is, however, inescapable 
when there is any real importance and likelihood of errors of bias in more 
than one direction. If a rule were established that a bomb should be 
classed as “bad” if its range error were such as to be exceeded with 
probability 0.01 and it should also be classified as bad if the deflection 
error were such as to be exceeded with probability 0.01, there would be 
some danger of misunderstanding. Many persons would suppose that 
the probability of a good bomb being wrongly classified as bad under 
such circumstances is 0.01, whereas actually this probability is in the 
neighborhood of 0.02. Such erroneous ideas abound in statistical 
practice in many fields. Use of the combined measure T M avoids this 
difficulty. 

The T statistics are algebraic invariants and as such possess certain 
desirable properties. Any transformation of coordinates such as a 
rotation of axes or a change in units of measure, if applied both to the 
old and to the new observations, leaves each of the T’s invariant and 
therefore does not affect the probabilities involved. This same property 
of invariance is also important in obtaining the relevant probability 
formulas. It implies the very important fact that the probabilities 
associated with the T’s are exact numbers, not functions of nuisance 
parameters whose values are never exactly known. 

There may be a thought that instead of T another invariant known 
as the “generalized variance” might be used.f The generalized variance 
has many important potential uses. However, it is not so suitable as 
T for the purposes here in view because, in two dimensions, it is essentially 
a measure of distance (in a generalized sense) from a line instead of from 
a point. While T M and To can be zero only if all the bombs fall on the 
target, and T D can be zero only if all the bombs fall on one point, the 
generalized variance will be zero if all the bombs fall on any one line, no 
matter how they are scattered along it. This consideration also elimi¬ 
nates for the present purpose several other statistics obtained by maxi¬ 
mum likelihood methods .J 

An objection likely to be made to the T formulas is against the inclu¬ 
sion of the term in the product of range by deflection errors. If the 

* See Hsu, “Generalized 7™; Wald, “Tests Concerning Several Parameters.” 

t Wilks, “Generalizations in the Analysis of Variance.” 

X Pearson and Wilks, “k Samples of Two Variables.” 
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correlation between range and deflection in the sample in the base period 
were exactly zero, this term would be zero. It may be argued on a priori 
grounds that the true correlation is zero, and that when, as in the sample 
here used, the observed correlation is only slightly greater than its stand¬ 
ard error (Sec. 5 gives, after adjustments, a correlation of —0.07535, which 
is 1.2 times its standard error), later work should proceed on the fixed 
assumption that the correlation is zero. A further extension of this idea 
might lead to ignoring the difference between the variances of range and 
deflection, and consequently using the mean square deviation, whose 
inefficiency has been described above. 

While omission of the small product term from T 2 would seem at 
first to save a little computational labor, it Would lead to great mathe¬ 
matical complexities in connection with the probabilities. The resulting 
function is not an invariant, and its distribution involves as a nuisance 
parameter the ratio of range variance to deflection variance in the popula¬ 
tion, which is estimated b' the sample only with an uncertainty that 
makes highly dubious any probabilities calculated on this basis. More¬ 
over, if it should turn out that there really is some correlation between 
range and deflection errors, whether the causes and magnitude of such 
correlation are known or unknown, this correlation would constitute an 
additional nuisance parameter having serious effects on the accuracy 
of judgments based on the observations. In this connection it should 
be recalled that, while the particular sample here considered does not 
establish existence of a correlation, other reports seem, at least super¬ 
ficially, to show high and significant correlations between range and 
deflection errors in practice bombing. Some officers are of the opinion 
that a real correlation exists. 

Another way of avoiding the calculation of products of range by 
deflection errors would be to make separate tests of significance for range 
and deflection, using the variance ratio distribution, and then to combine 
these by the method of combining independent test probabilities pro¬ 
pounded by Fisher and studied by Wallis and others.* This method 
requires use of the exact probabilities rather than of the values of the 
variance ratio at a few points such as those for probabilities 0.01 and 
0.05. Consequently, the tables of the F distribution must be used twice 
for every such test and thus for every point plotted on control charts 
corresponding to those for TV, T D , and To. The labor involved would 
amount to far more, and the assumptions necessary for validity are more 
stringent, by this than by the T methods. 

The methods based on T are valid and accurate whether a true cor¬ 
relation exists or not, and whatever the ratio of the trqe variances may be. 

* Fisher, Statistical Methods , Sec. 21.1 of Chap. 4; Wallis, “Compounding 
Probabilities. 11 
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The only limitations on this statement are in the assumptions of normality 
and independence, which are made throughout, not only in the present 
discussion but in virtually the whole literature of bombing and firing, 
including that concerned with methods of combining observations. 
These assumptions, in relation to test bombing, are discussed in Sec. 
3.1.4, pp. 146-147. 

5. STATISTICAL ANALYSIS OF CERTAIN TEST BOMBINGS 
5.1. Summary of the Data 

In order to obtain variances in range and deflection, and a correlation, 
from which the coefficients a, b , c in T 2 could be found, a series of 65 
flights was analyzed. These consisted of all those flights at a certain 
proving ground between November 4, 1942, and December 23, 1942, 
during which exactly four bombs were dropped. The coordinates 
of the point of impact of each bomb with respect to axes through the 
target, of which one is an average track for the whole flight, were supplied 
by the proving ground, together with the heading of the airplane when 
the bomb was dropped and other data. Two of the bombs on each 
flight were dropped while the airplane was moving in a generally northerly 
direction and the other two while it was flying in a southerly direction. 
The actual headings, however, varied considerably, the difference between 
the two in the same general direction on the same flight being commonly 
several degrees and in one case amounting to 25 degrees. We therefore 
applied to the coordinates received a rotation designed to give the errors 
in range and deflection with respect to the actual heading of the airplane. 
Table 3.1 gives the principal statistics summarizing both the original 
data on range and deflection and the values as adjusted by these rotations. 
Except for the first column of this one table, all numerical results given 
hereinafter will refer to the rotated values. In each case r denotes the 
error in range and d the error in deflection. Positive values of r refer to 
bombs overshooting the target, negative values of r to bombs that fall 
short. Deflection errors to the right are considered positive; those to 
the left are negative. The unit of measure is in each case the mil, 
about 1/1,000 of a radian, or about 2/35 of a degree. The data as 
received refer to actual impacts, uncorrected for differential ballistic 
wind, nonstandard altitude, nonstandard air speed, or range component 
of crosstrail. The actual altitudes are understood to have fluctuated 
rather widely about 7,000 ft., but since the errors are measured in mils 
the effects of these altitude variations on the results of the analysis 
should not be great. 

If any one rotation had been applied to all the points of impact, 
the squared distance f 2 + d 2 of the mean point of impact from the target 
would be invariant. The fact that r 2 + <J 2 has two different values, 8.83 
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for the original and 9.67 for the rotated data, results from the differences 
of the rotations applied for the different bombs. 


TABLE 3.1 

Summary of Bombing Errors in 65 Flights (260 Bombs) 



Original 

(mils) 

Rotated 

(mils) 

Mean range error* (over), f . 

0.5108 

1.3450 

Mean deflection error* (right), 3. 

2.9292 

2.8046 

Standard deviation of range (nors, a r . 

6.8596 

6.9074 

Standard deviation of deflection eriors, *. 

Correlation of range vdth deflection errors. 

8.7195 

-0.037 

8.6191 

-0.035 


* These are means of positive and negative errors, not of their absolute values. The first two lines in 
Table 3.1 would give the coordinates of the mean point of impact (Mi*!) of all the 260 bombs if the same 
course had been flown in each ca^e. Tiu.y reflect bin-.ed error, whereas the next two lines give measures 
of the unbiased or compensating errors, often measured less accurately by the “mean deviation,” or 
mean of absolute values of Cirors, Which to be comparable with the standard deviation, would need to 
be multiplied by a factor that for large samples approximates y/v/2 *= 1.253. 

Both the mean range error and the mean deflection error (that is, 
the means of the actual values of the errors, positive and negative, 
not of the absolute values) calculated from the rotated data are statis¬ 
tically significant relative to their standard errors. Thus it is clear that 
there were biased errors throwing the mean point of impact (MPI) beyond 
and to the right of the target. The bias in deflection could have been 
detected from the original, unrotated observations, but in them the bias 
in range, being smaller, was obscured by the addition of the variance 
among the four headings on the same flight to the variance due to other 
causes. 

As in various other bombing records, the standard deviation in 
deflection is greater than that in range. The excess is 24.8 percent in 
the present sample and is statistically significant. The appropriate 
test is in terms of F = (saM*, which equals 1.6158 for the original 
and 1.5570 for the rotated data. With the 259 degrees of freedom in each 
of the variance estimates, the probability that F should exceed 1.470 is 
0.001. When, as here, the greater variance is divided by the smaller, 
the probability is less than 1 in 500 of obtaining so great a ratio of stand¬ 
ard deviations as that actually observed if the true standard deviations 
are equal. At the customary 0.05 probability level, confidence limits 
for the percentage by which the standard deviation in deflection exceeds 
that in range are 12.6 percent and 38.2 percent. 

The correlations —0.037 and —0.035 are completely insignificant, 
since their standard error is 1 / y /259 = 0.0621. This of course does not 
definitely prove that range and deflection errors are mutually inde¬ 
pendent; it only fails to disprove it. 
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Among the 65 flights there were variations with respect to both 
personnel and equipment. Ten different teams, each consisting of a pilot 
and a bombardier, were represented. The number of flights made by a 
team varied from 1 to 17 in the several cases. Automatic Erection 
System (AES) was used in 24 cases and Stabilized Bombing Approach 
Equipment (SBAE) in 8 cases. The distribution of the flights in relation 
to personnel and equipment is shown in Table 3.2. The first columns 
designate the pilot and the bombardier by their initials and indicate the 
combination of the two by a number. 


TABLE 3.2 

Distribution of Flights by Personnel and Equipment 


Pilot 

Bombard¬ 

ier 

Team 

number 

Number of cases 

Neither 
AES nor 
SBAE 

AES 

only 

SBAE 

only 

Both 
AES and 
SBAE 

Total 
for team 

WT 

H 

1 

10 

5 

2 


17 

W 

S 

2 

6 


1 

3 

10 

R 

R 

3 

4 

3 



7 

W 

H 

4 

1 


1 

1 

3 

R 

H 

5 

7 

4 



11 

K 

U 

6 

1 




1 

K 

II 

7 

4 

4 



8 

K 

L 

S 

2 

2 



4 

W 

R 

9 

1 

2 



3 

K 

R 

10 

1 




1 

Totals. 

37 

20 

4 

4" 

65 


5.2. Analysis of the Data 

' The primary object of the analysis was to obtain suitable coefficients 
a, 5, and c for use in the expressions for T 2 that are to measure the accu¬ 
racy of individual bombsights. However, the variances entering into 
these expressions are presumably affected by the variations of equipment 
and of personnel among the flights examined. It is to be expected that 
this heterogeneity will increase the sample variances above the values 
they would have if equipment and personnel had been kept constant on 
all the flights. Such an increase of variance would reduce the values of 
a, b t and c, and therefore the values of T 2 computed from them in future 
tests, giving an illusory impression of accuracy in those tests in which 
equipment and personnel are close to the average situation in the cases 
analyzed. To determine whether this is a serious bias and to correct 
for it if it is, it seemed desirable to consider regression equations for 
predicting the range and deflection errors in terms of equipment and 
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personnel. At the same time, the opportunity was at hand to obtain 
tentative estimates characterizing the performance of particular teams 
and of the particular types of equipment and to test the accuracy of 
these estimates. 

Several different regression equations were used. In the first instance, 
to make a rather extensive elimination of extraneous factors, we con¬ 
sidered regression formulas with no less than 79 terms. Besides an 
additive constant, these terms were 

Xi, defined as 1 if AES was used, and otherwise 0 

X 2 y defined as 1 if SBAE was used, and otherwise 0 

xs, defined as 1 if the airplane was dying north, but 0 if it was flying 
south 

x\y (i = 1, 2, • • • , 10), equal to 1 if the bomb was dropped by the 
tth team, and otherwise 0. The number of variables in this 
group was of coursf 10, the number of teams. 

x" if (i = 1, 2, • • • ,10), equal to 1 if the bomb was dropped on the 
jth of the flights of the tth team, and otherwise 0. The number 
of variables in this group was 05, the same as the number of flights. 

Two different predictands, the range and deflection errors, were used with 
this same set of predictors. The regression equation for estimating 
range error in terms of all the predictors was of the form 

10 

R = 5o b\Xi + &2#2 + baXz + ^ b 

i -1 

with a similar formula for estimated deflection error D. The coefficients 
were determined by the method of least squares. 

It might seem that the solution of a least-square problem with 79 
unknowns would be an utterly fantastic task, to say nothing of two such 
solutions. Yet through special circumstances it was possible not only to 
carry through this calculation completely, with the relevant standard 
errors and tests for significance in terms of probability, but to do so 
with a very modest amount of computing. One of these circumstances 
was that, because of the nature of the variables, a large majority of the 
elements of the matrix of the normal equations were zero. Also, a 
simplification was obtained through choice of 13 linear relations that 
can be imposed somewhat arbitrarily upon the regression coefficients 
because of 13 identical linear relations among the predictors, which 
reduce the rank of the 79-rowed matrix to 66. These 13 identical rela¬ 
tions express the following facts: 

The sum of the values of x JJ for all the flights of the tth team equals 
x'i. There are 10 of these relations, corresponding to the 10 teams. 


V + 2 
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The sum x[ + x 2 + • • • + x[ 0 = 1 

The sum of the values of x"j for all flights using AES is xi 

The sum of the values of x (•' for all flights using SBAE is x 2 

A linear constraint corresponding to each of these was placed upon 
the regression coefficients in each group in order to make them deter¬ 
minate. Though it is customary in problems of this kind to use the 
simplest constraints by merely equating to zero the sum of the coefficients 
in each group, we chose a different set of constraints adapted to simplify¬ 
ing the numerical work. For example, where a constraint of the form 

b[ + K + * * * + 6' 10 = 0 

would ordinarily be imposed, we used instead 

176; + 10 b' 2 + 76J + ■ • • + 6i 0 = 0 

with coefficients equal to the numbers of flights of the respective teams. 
This made possible a great reduction in labor because of the occurrence of 
this particular linear combination in the normal equations. All results of 
importance involving the 6 r, s will necessarily be in terms of their differ¬ 
ences and do not seem to be affected by replacing one such constraint by 
another. 

The solution of the normal and constraining equations was carried 
out literally, in terms of symbols for the right-hand members of the normal 
equations, instead of numerically. In this way it was easy to obtain 
the two regression equations, one for range and the other for deflection 
errors, by substituting the two sets of right-hand members. Further¬ 
more, the literal solution made easy the determination of standard 
errors and correlations among the regression coefficients in either equation. 
Thus it played somewhat the same role as that of the inverse matrix in 
least-square problems involving no identical relations or constraints. 

The question whether there was significant variation among the 10 
teams with regard to bias in range and deflection errors was tested by 
means of the variance ratio distribution. The denominator of this ratio 
was the estimate of variance based on the sum of squares of residuals 
from the regression equation. Since there were 260 observations and 79 
regression coefficients connected by 13 constraints, the number of degrees 
of freedom for this residual variance was 260 — (79 — 13) = 194. The 
numerator was a quadratic form in the 6"s with 9 degrees of freedom, 
since they are subject to the one relation indicated above. The value 
of such a quadratic form can be found by first ignoring any one of the 6 ;, s 
(which we arbitrarily chose as b[) } inverting* the covariance matrix of 

* Several methods of numerical inversion of matrices are described in Dwyer, 
“Solution of Simultaneous Equations” and Hotelling, “Matrix Calculation.” The 
method given by Dwyer is particularly valuable. 
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the other nine corresponding to unit variance of residuals, and then 
calculating straightforwardly the quadratic form of which this inverse 
is the matrix and in which the variables are the nine b n s used. This 
has only to be divided by 9 to give the numerator of the variance ratio, 
essentially an estimate of variance among the b n s, but differing from a 
mere sum of squares of their deviations from their mean because of their 
unequal weights and correlations resulting from the unbalanced dis¬ 
tribution of equipment and personnel. The inverse matrix is the same 
for range as for deflection errors, since it depends only on the distribution 
of the predictors and not on the predictands. 

For range errors the variance ratio F found in this way was 1.423; 
for deflection errors it was 1.426. These are considerably less than the 
value 1.923 of F that would just be regarded as significant on the cus¬ 
tomary 0.05 level and indeed are little greater than the value that would 
be exceeded by chance, with probability 0.20 if the true mean range and 
deflection deviations were the same for all teams. Thus no evidence 
pointing to any variation among teams in these biased errors emerges 
from this test. 

With two variates such as range and deflection errors the two separate 
tests may, with a substantial increase in accuracy, be replaced by a 
single test. In place of the foregoing variance ratios we may well con¬ 
sider a combined test due to Wilks.* Each variance is replaced in 
this test by a symmetric two-rowed determinant. The elements of the 
denominator determinant are the variances and the covariance of range 
and deflection as estimated from the residuals from the two regression 
equations in which they are the predictands. 'The numerator deter¬ 
minant's elements are the two quadratic forms computed from the two 
sets of b n s, and the bilinear form with the same matrix and involving 
both sets of 6"s. The value 1.696 obtained for this generalized variance 
ratio would be exceeded by chance with a probability considerably greater 
than 0.05, though much further below 0.20 than in the separate variance 
ratios for range and deflection. Thus even this more sensitive test 
fails to give real evidence of variation in mean point of impact among 
the men engaged in dropping the bombs. That there is a real bias, at 
least in deflection, is easily inferred from Table 3.1 or from other bombing 
records. Apparently the bias is much the same for all the 10 teams on 
whose performance these records are based. 

Application of the corresponding tests for the effect on the MPI 
of AES, of SB AE, and of direction of flight was much simpler because each 
of these effects corresponds to only 1 degree of freedom, so that the test 
for either range or deflection reduces merely to the comparison of a regres¬ 
sion coefficient with its standard error. When AES was used, the MPI 

♦ Wilks, “Generalizations in the Analysis of Variance.” 
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was 1.6536 mils farther in the “ short ’ 1 direction than as if it had not 
been used, according to the regression coefficient, which makes allow¬ 
ances for effects' of SBAE, direction of flight, and personnel. This 
coefficient is statistically significant on the customary 0.05 probability 
level. Since the MPI for all the bombs was 1.3450 mils “over,” the use 
of AES appears to have been advantageous in this respect. The cor¬ 
responding coefficient for deflection was not significant, and the combined 
test by Wilks’s ratio of determinants also failed to yield a significant result 
on the 0.05 level. The same tests applied to SBAE gave results far from 
significant, suggesting no difference according as this equipment was 
used or not. 

The effect of direction of flight on MPI was tested by means of the 
coefficient f> 3 in the 79-term regression equation. For range the effect 
was insignificant. But for deflection the value —4.1045 was significant 
even on the 0.01 level. This indicates that the bias toward the right was 
considerably less flying north than flying south. 

The variations of MPI among flights were tested by an approximate 
method, simpler than that used for personnel variance, which would for 
65 flights with 53 degrees of freedom have involved the considerable labor 
of inverting a matrix of 53 rows. Such a piece of calculation is not so 
completely out of the question as would once have been thought, but it 
did not seem worth while for the present purpose. By ignoring the 
unbalance occasioned by disproportionate frequencies of equipment and 
personnel, while utilizing the balance of north and south flights and the 
fact that exactly four bombs were dropped on each flight, these considera¬ 
tions lead to the estimate 426” 2 /53 of variance among flights. This 
turns out to be insignificant for deflection but significant for range. 
The combined test for range and deflection failed to yield a significant 
result. Thus any important effect on MPI of variations among flights 
seems to be confined to the range components. Such variations refer 
to effects of varying weather and other factors not taken into account 
in the regression equation but do not include the variations of personnel 
and equipment considered. 

Since it is to be expected that pilots and bombardiers will change 
from time to time, as will also the weather, the variances of deviations 
from the 79-term regression functions will be less relevant than if the 
effects of the fluctuations had not been eliminated. If the personnel 
were to remain constant, and of constant ability, and if an adequate 
measure of seasonal variation were available from records over a period of 
years based on ore-filled bombs, the appropriate estimates of variances 
and correlation for appraising bombsights under test would be in terms of 
deviations from a regression equation involving all these factors. The 
omission of the terms in the regression function corresponding to per- 
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sonnel may also be justified on the ground that, as seen above, no dis¬ 
cernible differences between the MPFs of different teams have been 
found. 

Inclusion of an allowance for equipment type does seem to be needed, 
first because of the highly significant difference in MPI associated with 
AES, and second because of the prospect that future standardization 
will lead either to the use or the nonuse of each of the types of equipment 
with all sights tested alike. Furthermore, it is to be hoped that methods 
will be found to eliminate the difference between the MPI of bombs 
dropped on northward and on southward flights. The values recom¬ 
mended for the coefficients a, b y and c of liie test unctions T 2 to be used 
for future control charts have therefore been based on the residuals 
from regression functions with only four terms. Besides a constant, 
these terms involved 

X\ } which equals 1 if AES was used and is otherwise 0 

x 2 , whith equals 1 if SB A E was used and is otherwise 0 

2s, which equals 1 if heading was north, but 0 if heading was south 

These regression equations were found to be, for range errors in mils over, 

R = 2.9125 - 1.4457x1 - 2.8149x 2 - 1.3746x 8 

and for deflection, measured in mils right, 

D = 5.4099 - 0.8106xi - 1.0176x 2 - 4.3615x 8 

The deviations from either of these regression equations are 260 
in number but have only 256 degrees of freedom because of the four 
constants in the equation, in accordance with well-known least-square 
theory. The sum of squares of these deviations from one of the equa¬ 
tions, divided by 256, gives the estimated variance in range or deflection. 
In the same way, the sum of products of deviations from the two equa¬ 
tions, divided by 256, gives the estimated covariance. These values are 

Range variance. s 2 r = sn = 46.294995 

Deflection variance. s* = $22 = 70.033392 

Covariance. s r d = «i 2 = —4.290599 

From these values a simple calculation gives 

$ r = 6.8040 mils a* = 8.3686 mils r = —.07535 

the estimated values mentioned in previous parts of this chapter of the 
true standard deviations <r r and <r d and correlation p. The number 256 
of degrees of freedom is large enough to make the estimates 8 r and 8* 
quite reliable. The standard error of each of these standard deviations 
is only about 4.42 percent of its value. That of r is exactly making 
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the correlation only 1.206 times its standard error. The probability of 
so great a value of |r| in the absence of any real correlation is about 0.23, 
so that the hypothesis of independence between range and deflection is 
not refuted by these data, though there is a faint suggestion that bombs 
that fall short tend to fall to the right, while those going over the target 
tend to fall to the left. 

The inverse of the matrix 

[ Sn , l2 l 
L«i2 «S22_] 
is 

. [::] 

and from the above values we find 

a = 0.021724 b = 0.001331 c = 0.014360 
the values used in the plan presented in Sec. 2. 

6.3. Ratings of Equipment and Personnel by Test Bombing 

A series of test bombings can be utilized to obtain information not 
only regarding the sights used, but at the same time regarding other 
equipment and the men who do the flying and drop the bombs. For each 
flight Tl supplies a measure of the general success, and this is easily 
analyzed into the components T\, and T), corresponding, respectively, to 
deviations of the M PI from the target and of the bombs from the MPI. 
By combining the values of Tl (or of T\ or of Tl) for all the flights on 
which a particular kind of equipment was used, or on which a particular 
man or team flew, and contrasting with the flights not involving the 
same men and equipment, it is possible to obtain ratings of personnel 
and equipment. Approximate probability distributions are available 
for testing the reliability of such ratings, since each sum of independent 
values of T 2 may with little error be regarded as having the x 2 distribu¬ 
tion, and the ratio of two such sums the F distribution. Since several 
kinds of equipment in addition to the personnel can thus be rated on 
the basis of the same bombing records, such multipurpose test bombing is 
relatively very economical and productive of information. 

Complete validity of such ratings is, however, essentially dependent 
on a condition that must be provided before the bombings, not afterward. 
This condition is that, subject to whatever general restrictions are appro¬ 
priate, the assignment of personnel and equipment to particular flights 
ihust be strictly at random. There is no way of doing this except 
by the use of the actual mechanism of games of chance, or of their 
equivalent in the form of tables of random numbers. In the absence of 
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such objective chance, the assignments must be made by human beings. 
Experience has clearly shown that human beings unaided by gambling 
equipment are unable to assign different test materials to different 
experimental trials in such a way as to avoid bias, either in the averages 
to which the experiment leads or in the measures of error that must be 
computed as functions of deviations from the observed averages. Conse¬ 
quently, the ratings of the pilots and bombardiers in Table 3.4, while 
historically accurate, are subject to the possibility of being explained away 
by features of the method by which assignments were made. This same 
difficulty qualifies the statistical interpretation of economic time series 
and other such records which, though frequently the best available 
guides, lack the kind of validity possible only in a well-designed 
experiment. 

To secure efficiency as well as validity in an experiment, it is usually 
necessary to impose restrictions on the randomization. Thus in com¬ 
paring the results of two 1 1* atments when the chance variances are the 
same in both cases, the standard error of the difference of the means is 
proportional to y/\/N + 1 ~/N\ where N and N' are the numbers to 
which the two treatments are applied. If N + N 9 has any fixed value, 
the standard error is therefore a minimum when N = N'. Randomiza¬ 
tion in assignment of the cases to the two treatments should then be 
subject to the condition N = N '. This could be done by shuffling and 
randomly dividing a pack of cards consisting of black and red cards 
in equal numbers. It could not be done merely by flipping a coin 
once for each case, since the condition N = N f might not then be satisfied. 
This question of efficient arrangement of experiments through restricted 
randomization has many ramifications, particularly where several 
questions, such as the efficacy of several kinds of equipment serving dif¬ 
ferent purposes, are to be tested simultaneously in the same experiment.* 

Values of Tl, T 2 M} and T\ were computed for each of the 65 flights. 
Graphs of these three functions in the order in which the flights were 
made are appended as Figs. 3.8, 3.9, and 3.10, respectively. These 
figures have the general appearance of control charts. They differ, 
however, from suitable control charts in certain respects. Measures of 
quality should be plotted on a control chart in the order in which the 
articles are manufactured, whereas here the order is that in which the 
tests were performed. In this material several different series of serial 
numbers were represented, referring to different modifications of design 
and perhaps different manufacturers. It is best to run a separate control 
chart for each modification and for each factory. 

The points in these graphs are on the average higher relative to the 
theoretical probability 0.20, 0.05 and 0.01 levels shown than would be 

* Fisher, Design of Experiments , is the best treatise on this subject. 
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Fig. 3.8.—Over-all measure of error To applied to sample data. 
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expected in a manufacturing and testing routine in a state of control. 
The reason for this lies in a particular feature of the way the charts were 
constructed rather than in any shortcomings of the sights tested. The 
variances on which the coefficients in T 2 are based were chosen in Sec. 
5.2 on the basis of deviations from 4-term regression functions, in order to 
eliminate from these estimates of variance the effects, which it is to be 
anticipated are not to continue long, of the differences between use and 
nonuse of the Automatic Erection System and Stabilized Bombing 
Approach Equipment, and between northward and southward flights. 
This elimination has the effect of decreasing the variances somewhat and 
thus of increasing the coefficients in uich of the expressions for T 2 . 
If a, 5, and c had been calculated by inverting the two-rowed matrix of 
sums of products of the raw deviations from the target, and if these raw 
deviations had then been substituted in 7% T 2 M} or T%, the average of 
the resulting values would for each of these three functions be exactly 
the number of degrees of freedom, respectively, 8, 2, and 6. If, on the 
other hand, with the values of a, b, and c actually used, r and d had been 
replaced by the deviations from the 4-term regression functions, the 
averages would again in each case be exactly equal to the number of 
degrees of freedom. But because actual deviations were substituted for 
r and d while corrected variances were used in determining the coeffi¬ 
cients, the averages exceed the numbers of degrees of freedom. Thus the 
average of the 65 values of T% is 9.00786 instead of 8. This is reflected 
graphically in the fact that the proportions of the 65 cases above the 
horizontal lines are in general somewhat greater than the respective 
probabilities. 

From Table 3.3 it appears that AES and SBAE, instead of improving 
bombing accuracy, seemed to make it worse. The significance of this 
apparent effect may be tested (with an approximation equivalent to 
identifying the sample variances and covariance based on 256 degrees 


TABLE 3.3 

Effect of Equipment on Over-ali, Accuracy of Bombing 
(Excellent bombing is indicated by low values of Tq) 



Number 
of flights 

STJ 

Average 3HJ 

Flights using AES. 

24 

228.350001 

9.51458 

8.71125 

9.94430 

8.87643 

Flights not using AES. 

41 

357.101062 i 

Flights using SBAE . 

8 

79.554375 

Flights not using SBAE.. . 

57 

505.956688 ! 



of freedom with the true values of these parameters in an infinite popula¬ 
tion) by means of the variance ratio distribution with numbers of degrees 
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of freedom equal to eight times (since each flight has 8 degrees of freedom) 
the numbers of flights with and without the equipment in question. 
For AES the ratio is 


9.51458 

8.71125 


1.09222 


with ni = 192 and n 2 = 328. This is not at all significant since the 
probability of a greater value of F is considerably more than 0.20. 
Actually, the relevant probability is more than double this value because 
of our selection of the greater mean value of Tl to put in the numerator. 
For SBAE we have 


9.94430 

8.87643 


1.12030 


which, with n\ = 64 and n 2 = 456, is similarly insignificant. Thus these 
results do not prove that AES and SBAE have deleterious effects. They 
would not actually prove it in a really satisfactory manner even if the 
results had turned out to be significant, since the assignment of the 
equipment to particular flights was presumably not on the basis of objec¬ 
tive randomization. It is entirely possible that the decision to use or 
not to use either of these kinds of equipment on each occasion had some 
relation, conscious or unconscious, to the weather, the altitude, the 
personnel, or some other variable affecting the accuracy. 

The same qualifications apply to the average values of Tl for the 
10 teams shown in T&hle 3.4. Such scores are reliable indicators of 
differences in personal ability only if one of two conditions is satisfied. 


TABLE 3.4 

Variation of Over-all Accuracy with Personnel 
(Excellent bombing is indicated by low values of Tq) 


Teams 

Number of flights 

XT’* 

Average Tq 

1—WT-H 

17 

184.834959 

10.87264 

2—W-S 

10 

73.105782 

7.31058 

3—R-It 

7 

62.168030 

8.88115 

4—W-H 

3 

30.389824 

10.12994 

5— R-H 

11 

72.692086 

6.60837 

6—K-J 

1 

3.240875 

3.24088 

7—K-H 

8 

98.558949 

12.31987 

8—K-L 

4 

22.423547 

5.60589 

9—W-R. 

3 

28.637496 

9.54583 

10—K-R 

1 

9.459515 

9.45952 

All flights..... 

65 

585.511063 

9.00786 


Either the assignment of the men must be suitably randomized, or an 
exhaustive knowledge of the other causes of variation must be at 
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and must show that these other causes exactly balanced as between 
the different teams. Since it is practically impossible to fulfill the second 
condition, objective randomization in advance is the only practical way 
of validating scores. 

Even under randomization, differences in scores prove nothing unless 
they are too great to be ascribed to chance. As between any two teams 
selected without reference to their scores in this bombing series, the 
effects of chance may be examined by means of the variance ratio, equal 
to the ratio of the mean values of 7% with numbers of degrees of freedom 
equal to the numbers of flights of the two teams multiplied by the number 
of degrees of freedom, here 8, for each flight. However, it would be an 
error to compare in this way two teams selected because their scores are 
the greatest and the least or because they occupy adjacent places when 
arranged in the order of the scores. Whether the 10 scores as a 
whole differ significantly fjrom one another might be determined by the 
application of such a critfr.on as Bartlett’s.* The tests might also be 
refined, if it should seem worth while, by a more complete segregation 
of the effects of personnel from those of each kind of equipment, here 
represented by overlapping degrees of freedom, and of interactions. 
We are dealing here with generalizations of the ordinary analysis of vari¬ 
ance, generalizations whose fundamental theory has been developed 
only part of the way necessary for an efficient analysis of the ramifications 
of bombing accuracy. 
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CHAPTER 4 

EFFECTS OF ROUNDING OR GROUPING DATA 

1. INTRODUCTION 

A statistician must of necessity work with rounded numbers, at 
least to the extent that his data ^re recorded values of continuous 
variables, that is, measurements. Enumeration data may be rounded 
also, to tens, hundreds, thousands, or millions, for convenience in han¬ 
dling. The more coarsely the data are rounded, the less accurately do 
they represent the facts observed, and very coarse rounding may render 
them useless as a basis for conclusions about the phenomena concerned. 
This is self-evident; but it is not always appreciated that rounding 
considerably less coarse thajp that which renders the data useless may, 
nevertheless, greatly impair or even invalidate the analysis of the data. 

William Sealy Gosset, better known as “Student,” has given a most 
penetrating discussion of the effects of coarse rounding or grouping of 
data on their statistical properties.* Referring to certain data, relating 
to height and left middle finger measurements of 3,000 criminals, which 
he had used for an experimental test of his theoretical results, he wrote: 

Another thing which interferes with the comparison is the comparatively 
large groups in which the observations occur. The heights are arranged in 1-inch 
groups, the standard deviation being only 2.54 inches, while the finger lengths 
were originally grouped in millimeters, but unfortunately I did not at the time 
see the importance of having a smaller unit and condensed them into 2-milli- 
meter groups, in terms of which the standard deviation is 2.74. 

Several curious results follow from taking samples of 4 from material disposed 
in such wide groups. The following points may be noticed: 

(1) The means only occur as multiples of 0.25. 

(2) The standard deviations t occur as the square roots of the following types 
of numbers: n, n + 0.19, n + 0.25, n + 0.50, n + 0.69, 2 n + 0.75. 

(3) A standard deviation belonging to one of these groups can only be asso¬ 
ciated with a mean of a particular kind; thus a standard deviation of y/2 can 
only occur if t he m ean differs by a whole number from the group we take as 
origin, while y/lM will only occur when the mean is at n ± 0.25. 

(4) All the four individuals of the sample will occasionally come from the 
same group, giving a zero value for the standard deviation. Now this leads to 
an infinite value of z% and is clearly due to too wide a grouping, for although two 

* Student, “The Probable Error of a Mean,” Sec. IV. (For full citations, see 
Sec. 7.) 

fHere Student uses the term “standard deviation” to refer to the root mean 
square of the deviations of the four observations in a sample from their mean. 

t Here Student uses z to refer to the deviation of the sample mean from the popula- 
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men may have the same height when measured by inches, yet the finer the 
measurements the more seldom will they be identical, till finally the chance that 
four men will have exactly the same height is infinitely small. If we had smaller 
grouping the zero values of the standard deviation might be expected to increase, 
and a similar consideration will show that the smaller values of the standard 
deviation would also be likely to increase, such as 0.436, when 3 fall in one group 
and 1 in an adjacent group, or 0.50 when 2 fall in two adjacent groups. On the 
other hand, when the individuals of the sample lie far apart, the argument of 
Sheppard's correction will apply, the real value of the standard deviation being 
more likely to be smaller than that found owing to the frequency in any group 
being greater on the side nearer the mode. 

These two effects of grouping will tend to neutralize each other in their effect 
on the mean value of the standard deviation but both will increase the variability. 

Accordingly, we find that the mean value of the standard deviation is quite 
close to that calculated, while in each case the variability is sensibly greater. 
The fit of the curve is not good, both for this reason and because the frequency is 
not evenly distributed owing to effects (2) and (3) of grouping. On the other 
hand, the fit of the curve giving the frequency of z is very good, and as that is the 
only practical point the comparison may be considered satisfactory. 

The Statistical Research Group received various sets of data that 
exhibited curious statistical properties. In some instances it was found 
that these properties were attributable to the coarse rounding employed 
in recording the data. In at least one instance the peculiarities were 
attributable to the fact that a fraction of the data had been recorded with 
half the precision of the remainder, owing (in part) to intrinsic differences 
in the precisions of the two machines used to obtain the measurements. 
Occasionally the Group was asked to make recommendations with 
regard to the recording of data, since its investigations had shown that 
in many cases the trouble arose in connection with recording the data 
and not in the actual reading of the scales. As a consequence of difficul¬ 
ties encountered in analysing coarsely rounded data and of requests for 
advice on rounding in general, a study of rounding was undertaken, 
principally from the viewpoint of its effect on the standard procedures 
of statistical analysis. The results of this study are summarized in this 
chapter. Much of the material presented here can be found in the litera¬ 
ture of statistics, but it is widely scattered. 

2. ROUNDED NUMBERS AND SCALE READINGS 

Rounded numbers enter the practical man's world through two differ¬ 
ent doorways. First, the measurements of a scientist or engineer are 
rounded numbers, since all measuring equipment is of limited accuracy, 
cond, data are often recorded to less accuracy than is attainable with 
e avai a ^ e equipment > the accur acy preserved being more easily 

^c^.fbilow) 57 thC Standard deviation of the sample, the latter defined as above 
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attained and “adequate for the purposes at hand”; or they are rounded 
in computing, sometimes to simplify arithmetical calculations, as when 
0.40 is used in place of 51/128 (0.3984375 is exact), and sometimes 
because it cannot be avoided, as when 3.1416 is used in place of v* 

In the case of the computer’s reduction of exact numbers to rounded 
numbers in decimal form, assuming that he has rounded them correctly 
in accordance with the rule given in the next section, it can be stated 
that each is accurate within unit in the last decimal place retained. In 
the case of the scientist’s or the engiBeer’s measurements this may not be 
true, and yet the last figures retained may contain worth-while informa¬ 
tion, in the sense that when the last figures are -etained the data are 
more accurate than when the last figures are omitted entirely or are used 
as a basis fn- rounding the results to one less place. In such cases, if 
the measurements are all o r comp air ble accuracy, there will exist a 
“unit” of measurement wit re ,>cot to which the uncertainty of a 
given mea -u loenl i* 1 2 d Uf 1 

Consider, for example, the case of an experimenter taking “readings” 
on a scale with graduations ever % in. He can read his determination 
to the nearest in. directly and may then visually estimate it to the 
nearest 3a 6 in., obtaining, say, a reading of 3 jHl 6 in* If he writes this out 
in decimal form it becomes 3.4375 in., and represents some number 
between 3.40625 in. (3 1 J^ 2 in.) and 3.46875 in. (S 1 %2 in-)> and this is 
accurate only to within ±0.03125 in. and not to within 0.00005 in., as the 
number of decimal places retained might suggest. He may decide, 
therefore, to report the value as 3.44, which is accurate only to within 
+0.02875 and —0.03375, but which will be closer to the true value than 
is 3.4, unless the true value lies between 3.40625 and 3.42000. When so 
reported, however, his readings will not give a clear indication of their 
accuracy. If an unambiguous indication of their accuracy is important, 
he should report his readings in units of % in., in which event they will be 
accurate to within plus or minus unit. 

3. THE ROUNDING RULE AND ITS PROPERTIES 

When a number expressed in the decimal system is to be rounded 
to a specific number of significant figures, f the rounding should always be 
carried out in accordance with the following rule: 

* The mathematician has devised convenient symbols for certain very practical 
quantities, such as the diagonal of a square of unit side (\/2) and the ratio of the 
circumference of a circle to its diameter ( 7 + which cannot be expressed exactly as 
common fractions or as decimal numbers with a finite number of decimals. By 
using the mathematician’s symbols for these quantities, the practical man can usually 
keep his algebraic manipulations simple, but sooner or later there comes a point at 
which a numerical answer is needed, and here it becomes necessary to replace x 
(3.14159 26535 89793 23846+) with a common fraction, such as 355/113 (which is 
correct to 6 decimal places), or with a decimal approximation such as 3.141593. 

t A significant figure of a number (expressed in the decimal system of notation) 
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Rounding Rule: 

а. If the first digit to be dropped is less than 5, the last digit retained 
shall be left unchanged. 

б. If the first digit to be dropped is greater than 5, or is a 5 followed 
by digits greater than 0, the last digit retained shall be increased by 1. 

c. If the first digit to be dropped is a 5 alone, or a 5 followed by 0’s 
only, the last digit retained shall be rounded to the nearest even number. 
Thus, rounding to 2 decimal places, 

7T = 3.141593 yields 3.14 
e = 2.718282 yields 2.72 
1/22 = 0.045455 yields 0.05 
1/200 = 0.005000 yields 0.00 
1/198 - 0.005051 yields 0.01 
10/2,247 = 0.004450 yields 0.00 

It is clear that to every number that is to be “rounded down” in 
accordance with part (a) of the rule there corresponds a number that is 
to be “rounded up” in accordance with part ( b ). Parts (a) and ( b) thus 
establish a balance between rounding down and rounding up for all 
numbers other than those that are exactly 5 in the first place to be 
dropped. Part (c) establishes a similar balance between rounding down 
and rounding up for numbers that are exactly 5 in the first place to be 
dropped, by specifying that the rounding shall be up when the preceding 
digit is 1, 3, 5, 7, and 9; down, when it is 0, 2, 4, 6, 8. Part (c) is arbitrary, 
since rounding to the nearest odd number would establish a balance in 
these cases equally well, being equivalent to merely interchanging the 
words up and down in the preceding sentence. However, uniformity of 
procedure is important for checking, and consistency of published tables is 
desirable, so part (c), being well established, should be adhered to. 

A question that arises naturally at this point is: Will repeated applica¬ 
tions of the rule (dropping digits singly, or in pairs, or otherwise) always 
lead to the same result, when the rounding is stopped at a specified place, 
as a single rounding to that place? The answer, unfortunately, is No. 
The fifth example in the list above is a case in point. If 0.005051 is 
rounded to 2 decimal places in accordance with the rule in almost any 
manner involving more than one step, the result is 0.00, which is “incor¬ 
rect.” The third example may be used to illustrate the fact that the 
number of places to which a number is originally tabulated may affect 
the outcome of subsequent rounding: 3^2 = 0.0 45 45 45 45 ad infinitum; 

is any digit that contributes to the specification of the magnitude of the number aside 
from serving to locate the decimal point. For example, 345 and 0.000345 each have 
three significant figures; 34.05 and 34.50 (as distinct from 34.5) each have four 
significant figures; 34500 has 3, 4, or 5 according as it means 3.45 X 10 # , 3.450 X 10\ 
or 8.4500 X 10*, respectively. 
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correct to 6 decimal places it equals 0.045 455, the value given in the 
table above; if this latter value is rounded to 5 decimals in accordance 
with the rule, the result is 0.045 46; however 3^2 equals 0.045 45 correct to 
5 decimals, the result that will be obtained by rounding to 5 decimals 
when the value originally tabulated is correct to 7 decimals or more. 

It is clear from the foregoing that, when an n-place table is prepared by 
rounding the entries given in an (n + fc)-place table, some of the entries 
may not be correct in the nth place. Thus, in a table of log, N to 5 
decimal places, the 5-place value given for log e 1947 = 7.57404 50054 
will be incorrect in the fifth place if obtained by rounding from less than 
an 8-place table with entries correct in the eighth place. Final 5’s can, 
however, be marked to show whether further rounding should be up 
or down; see, for example, Tables 1.1 and 1.5 of Chap. 1, in which each 
final 5 is italicized if rounding should not be in accordance with part (c) 
of the rule. 

It may be asked whether Ihere exists a rounding rule that will always 
lead to the same result at a specified place, irrespective of the number of 
applications of the rule employed in rounding to that place. A procedure 
suggested by Cayley* possesses this advantage. According to this 
procedure, the figure in the last place to be retained is not adjusted at 
all—which is equivalent to rounding down irrespective of the digits that 
follow—with the result that the number recorded either is exact or errs 
in defect. If, when a recorded number of this type is used, a final 5 is 
affixed to it, then the error will lie between plus and minus Yi unit in the 
last place recorded, as is the case when the number recorded is rounded 
in a single step from its exact value in accordance with the Rounding 
Rule. For example, t would be recorded to 4 decimals as 3.1415, and 
would be taken to be 3.14155 in further calculations. Whereas Cayley’s 
procedure has the advantages that it is simple (since numbers are always 
rounded down), that it can be mechanized for purposes of machine 
computation, and that it yields an n-place recorded number that is 
independent of the number of rounding steps involved in reaching it from 
the exact value, it has the disadvantage that it requires the use of (n + 1)- 
place numbers in further computations in order to attain the same 
degree of accuracy that would be obtained by using n-place numbers 
rounded in accordance with the Rounding Rule. 

4. EFFECTS OF ROUNDING ON STATISTICAL ANALYSES 
OF LARGE NUMBERS OF OBSERVATIONS 

4.1. Nature of the Effects 

Frequently a statistician analyzes measurements recorded to several 
decimal places more than can possibly have any practical significance. 

* Cayley, letter to J. W. L. Glaisher. 
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The extra decimal places do not occasion any great concern, for nothing 
has been done that cannot be undone. Recording the extra digits and 
rounding the measurements are merely unnecessary nuisances. On the 
other hand, measurements are sometimes recorded to the nearest 5, 10, 
20, 60, or 100 units, or still more coarsely, when they could, and should, 
have been recorded to the nearest unit, or even more accurately. Thus, if 
duplicate measurements taken under essentially the same conditions tend 
to differ on the average by about 50 units, research workers often record 
the measurements to the nearest 10 or 20 units; sometimes only to the 
nearest 60 units. If measurements taken under essentially the same 
conditions tend to be distributed approximately normally about a 
“true value” with a standard deviation of a, these roundings correspond 
to intervals with widths roughly 0.23a, 0.45a, and 1.1a, respectively.* 
Statisticians, on the other hand, recommend that the width of the round¬ 
ing interval be less than one-third, or, better, less than one-fourth the 
standard deviation of random sampling, f 

Basically, the choice of the size of a rounding or grouping interval 
depends on the answers to the following three questions 

a. If the observations X\, x 2 , . . . , x N are to be regarded as inde¬ 
pendent values of a random variable x , which, in the population from 
which the data are drawn, is distributed about a mean of n with a stand¬ 
ard deviation a, then what is the effect of the rounding (grouping) on the 
sample estimates of /z and a, defined by 


N 



b . What is the effect of the rounding on confidence limits for /z and a 
based on x and s? 

c. What is the effect of the rounding on statistical tests of hypotheses 
about /z and a based on x and s? 

In the present section, answers to these questions will be given for 
the case when the number of observations N is large. In Sec. 5 the case 
where N is small, perhaps only 2 or 3, is discussed; the results given there 

* The average difference between duplicate measurements is, under the assump¬ 
tions stated, 2a/y/i 888 1.12838a. If this average difference is 50, a grouping interval 
of 10 corresponds with 1.12838a/5 — 0.226a, and similarly for a grouping interval 
of 20. 

t See, for example, Fisher, Statistical Methods , Sec. 13. 
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throw light on what values of N may be regarded as large and what values 
must be regarded as small. 

4.2. Sheppard’s Corrections 

In order to answer the first question, consider the relation between 
the ftth (population) moment 

fi - £(**) = /_V *^»(*) (A; = 1, 2, 3, 4, • • • ) (3) 

of a continuous random variable distributed according to the probability 
density function* p(x), and the kth moment 

4 = 2 L.J P{ - 1) dt (* - 2. 3, 4, • • • ) (4) 

t — — <*> 2 

of the histogram formed by grouping the values of a; into classes of width 
w with mid-points a;*, (^' *= 0 ±1, ±2, • • • ). The p k are, by definition, 
the moments (about the origin) of x in the population, in which a; is a 
continuous variable. On. the other hand, since it is not possible either 
to read or to record with perfect accuracy all values of a continuous 
variable, recorded values of x will generally be rounded values, any value 
of x between Xi — (w/2) and a\- + (w/2) being recorded as a*. Hence, in 
a sense, the v k are realistic moments of x and the p k are ideal moments. 

If, first, p(x) can be expanded in a Taylor series throughout the range 
for which p{x) > 0, second, x*p (2n) (x) is finite and continuous throughout 
this range (ft = 1, 2, 3, 4, • • • ; n = 1, 2, 3, 4, • • • ), and third, p{x) 
and p (n> (x) (n = 1, 2, • • • ) vanish at the limits of this range, then 
the v k and the p k are related as follows: 


v[ = Mi 
V 2 — /*2 + J 2 



CL (jj' (2i + 1)] mL« 

where = ft!/[r!(ft — r)!] for 0 < r < ft, and C* = 0 for r > ft.f 

* That is, for any value of c, the integral P(c) » J ^ p(x) dx equals the probability 

that x is less than c, 0 < P(c) < 1 for all values of c, P( — oo) « 0, and P(-f oo) m 1. 

t Sheppard, “Most Probable Values of Frequency Constants.” The correspond¬ 
ing case of two variables has been considered by Baten, “Corrections for the 
Moments,” and of several variables by Wold, “Sheppard’s Correction Formulae.” 





194 


TECHNIQUES OF STATISTICAL ANALYSIS 


Chap. 4 


The three conditions regarding p{x) imply that it is not only con¬ 
tinuous over the range for which p(x) >0 but tapers off strongly to zero 
at the ends of this range; they thus exclude probability curves with 
abrupt ends, such as a uniform distribution over —w/2 to +w/ 2, and 
{/-shaped or J-shaped curves. The three conditions are necessary and 
sufficient for the relations (5) to hold exactly. Relations analogous to 
(5), with additional terms, have been obtained for cases excluded by the 
above assumptions.* 

The relations (5) can be derived in a somewhat more general manner 
that shows that they are valid on the average whenever the moments 
involved exist, provided that the rounding lattice is imposed at random 
on the underlying distribution, f Suppose the x axis is first divided in a 
definite manner into an infihite number of class intervals of width w/m , 
where m is some integer greater than 1. Given a probability density 
function p(x) } the histogram corresponding to this grouping of the values 
of x can always be determined from 
.4. f w \ 

P(x,) = f , K *\ J p(0 dt (j = 0, ±1, +2, ••• ) (6) 

where Xj is the mid-point of the jth subdivision, since the integrals 
involved will always exist when p{x) is a probability density function. 
The moments about the origin of this basic histogram, /4', say, can be 
determined from the summation analogous to (4), so far as they exist 
(that is, are finite). Corresponding to the basic histogram determined 
by (6), m new histograms can be formed by combining the successive 
subclasses of width w/m into subclasses of width w, the intervals con¬ 
taining xo , X\, . . . , Xm-i being combined in one instance, the intervals 
containing Xi , # 2 , . . . , x m being combined in the next instance, and 
so forth. Corresponding to each of these histograms, moments about the 
origin, v' k (s), s = 1 , 2, • • • , m, can be determined from summations 
analogous to (4), so far as they exist. Let v' k denote the average of the 
values of v' k (s), s = 1,2, • • • , m, then the v k and the y!/ are related as 
follows, in so far as they exist: 



* See papers by Elderton, Pairman and Pearson, Pearse, and Pearson listed in 
Sec. 7 below. 

t See the papers by Abernathy, Craig, and Carver listed in Sec. 7. 
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and so forth. Letting m —> <», the right-hand sides of these equations 
tend to the corresponding expressions in equations (5), since as 

ra —* oo. On the other hand, the v k retain their meanings as average 
values of the v' h . Accordingly, whatever the nature of the distribution 
of x ) equations (5) hold on the average , that is, they give the average 
relationships between the v' k and the n' k) provided these are finite, for an 
infinite series of trials in which a rounding lattice composed of sections of 
width w is imposed at random on the underlying probability distribution. 

The results given above all refer to moments about a fixed point 
taken as the origin, and the values of the moments involved will depend 
upon the choice of the origin. Thus, if x denotes temperature and is 
measured in degrees Fahrenheit, the values of tfye yf k and v k will be differ¬ 
ent from those obtained when x is measured in degrees centigrade. The 
results obtained will be the same, however, if the origin is chosen at 
the mean of the distribution of x. Dropping the primes to signify 
moments about the mean of x , relations (5) correspond with 

v\ ** Mi ■* 0 

V 2 « M2 + ^2 ( 8 ) 

V * = /X3 

. w 2 . W A 
V4 = M4 + y • Hi + gQ 


and so forth. 

Comparison of the second equation of (8) with (22) shows that the 
variance (second moment about the mean) of a rounded value of x , which 
may be denoted by x R for convenience, equals the variance of x, namely /x 2 , 
plus a variance w 2 / 12, which equals the variance of a rounding error c, 
which is distributed uniformly between — w/2 and +w/: 2. This suggests 
that we may write 

x R = x + € (9) 

where x R denotes a value of x rounded to within ±w/ 2, and € is a random 
variable independently distributed with uniform probability density 
between — w/2 and +w/2. That the representation (9) is strictly valid 
under the three conditions stated above, and that it is valid on the average , 
in the above sense, under any conditions for which the moments exist, 
has been demonstrated rigorously. * 

Since it is usually desired to determine the ix k from the v k} equations 
(8) are customarily solved for the in terms of the v k and powers of w, 
and the relations given as 

* See papers by Craig and by Langdon and Ore, listed in Sec. 7. 
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Mi = vi = 0 
w 2 

M 2 = - J 2 (10) 

Ms = Ps 

w 2 , 7w 4 

M4= -4- T - -’* + 240 

and so forth.* 

These adjustments of the v k to obtain the m* are customarily called 
Sheppard’s corrections for grouping. As has been indicated above, they 
are valid as average adjustments whenever the moments involved exist, 
provided that the rounding lattice is imposed at random on the under¬ 
lying distribution. This latter provision means, for example, that when 
values of a variable that can take only nonnegative values are to be 
rounded, zero shall not inevitably be adopted as the lower extremity of 
the lowest class—indeed, the mid-point of the lowest class may turn 
out to be a small negative number. If the rounding lattice is not imposed 
at random when Sheppard’s original assumptions are not fulfilled, 
then the foregoing adjustments may be wildly wrong and may even 
adjust in the wrong direction, f 

* 

4.3. Comparison of the Distribution of the Mean of N Rounded Observa¬ 
tions and the Distribution of the Mean of N Unrounded Obser¬ 
vations, When N Is Large 

Sheppard’s corrections make possible a concise comparison of the 
distribution of the mean of N rounded observations and the distribution 
of the mean of N unrounded observations, when N is large. Continuing 
the notation adopted in connection with (9), let 

*■ - l W <“> 

t-1 

where Xm is the rounded value of x t -, the zth of N independent values of a 
random variable x, corresponding to a rounding lattice with classes of 

* The corresponding relations for p 6 to are given in Rietz, Handbook } p. 94. 
They can be obtained by letting m —► oo in the expressions for to given by Craig. 
Letting A* and L k (k — 1, 2, • • • ) denote the semi-invariants of the unrounded 
variable x and the rounded variable xr, respectively, the corresponding relations 
become simply 

X2m — Lf2m ”1" ( 1 ) m B m 

X2m+1 = L 2 fn.fl 

where B m is the mth Bernoulli number. Thus, *= La 4- w 4 /120, A« — L« — w*/ 252, 
X§ “ L§ + it> 8 /240, and so forth. For further details, see the papers by Craig, 
Langdon and Ore, and Wold. 

t The reader who wishes to delve further into the mathematical background of 
Sheppard’s corrections is urged to read Kendall’s paper. 
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width w (located at random on the z axis, if Sheppard’s assumptions are 
not fulfilled). Furthermore, let 


= Ml:* 


—2 __ „ 

** ~ M2 ; * 


the mean of x in the population 
the variance of x in the population 
the &th moment of x about its population mean = n k:x 

md let ' *' 2 ' 

w = a • <r* 


( 12 ) 

* * ) 
(13) 


Table 4.1 presents a comparison of the distribution of x R in regard to 
mean, variance, skewness, and kurtosis with fjhe distribution of x and 
with a normal distribution of mean n[ and variance a 2 . 

From a study of Table 4.1 it is apparent thkt 

( 1 ) x and Xr are each unbiased estimators of the population mean of 
x. 

(2) The variance of a ; « is (1 + oY 12) times the variance of x. 

(3) The skewness (71 ^ + Vj^i of Pearson’s notation) of the dis¬ 
tribution of x R never exceeds that of the distribution of x. 

(4) The kurtosis (72 = (3 2 — 3 of Pearson’s notation) of the distribu¬ 
tion of x R is always less than that of the distribution of x. 

( 5 ) When N is large, x and x R are approximately normally distributed 
about the population mean of x } n\. x . 

The case in which the ungrouped variable x is normally distributed 
is of special interest. In this case x is likewise normally distributed, 
whereas the distribution of x R is symmetric but has slightly less kurtosis 
than a normal distribution, the deficiency being (a 4 /1207V)/(l + a 2 /12) 2 . 
From the Gram-Charlier series for the “generalized normal curve,”* 
it is seen that this deficiency in kurtosis implies that the distri¬ 
bution of xr has slightly less area in the “central region,” say, between 
± 0 . 75 < 7 i B , slightly less also in the “tails,” say, beyond ±2.3^, and 
slightly more between these points. For any practical rounding, say, 
a = w/a < 1 , this effect will surely be negligible. R. A. Fisher has 
shown f that the maximum likelihood estimate A of p! llx from N inde¬ 
pendent rounded values x R of a normally distributed variable x has a 
variance of (1 + a 2 / 12 — a 6 /2,880 + ...). a\/N. The loss of precision 
when xr is used instead of /z is, therefore, only a fl /2,880, which is clearly 
negligible for any practical rounding. 

In conclusion 

a. x R is an unbiased estimator of the population mean of x, Mi.-*, 
whatever the value of N-. 

b. When N is large and the values of a 2 and w are known, so that 

= (<rl + w 2 /12)/N can be evaluated, confidence limits for, and 

* See, for instance, Rietz, Handbook , p. 114. 

t Fisheb, “Mathematical Foundations,’^ p. 362. 
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statistical tests of hypotheses regarding the value of n' l:x carried out 
with (xr — ni: x )/<nc a regarded as a normal deviate will be approximately 
correct—provided the true distribution of x has “high order contact” 
(that is, Sheppard’s assumptions are valid), or the rounding lattice is 
located at random with respect to the admissible values of x, or both. 

TABLE 4.1 

Comparison of the Mean, Variance, Skewness, and Kurtosis of the 
Distribution of the Average of N Independent Rounded Values of a Random 
Variable with the Corresponding Characteristics of the Distribution of the 
Unrounded Variable and of a Normal (Gaussian) Distribution When the 
Width of the Rounding Interval Is o Standard Deviation Units (w — a • a) 



Distribution 

Characteristic 

Unrounded 
variable, z 

Average of N 
rounded values, £ R 

Normal 

Mean. 

/ 

M i:x 

/ 

Mi:* 

1 

Mi 


Variance. 

cl 

/ a*\ 1 




( 1 + r 2 )' : 

(T* 


* 

N 


ftlrpwnpss 

Ms:* 

71 :* 

0 


Yl.z 3 

a x 

M ,+ 0 

Kurtosis (^2) . 

fUlx 

72:* = — — 3 
or* 

a* 

yi:x ~ Wo 

0 





4.4. Comparison of the Distribution of s 2 for Rounded and Unrounded 
Observations, When the Number of Observations Is Large 
Let 


N 



( X Ri — Xr ) 2 

N - 1 


(14) 


where x*,- (i = 1, 2, • • • , N) and xr are as in (11). In Table 4.2 the 
mean of s>l R and the ratio of the variance of s 2 b to the square of its mean 
are compared with the mean and corresponding ratio for s\ under two 
sets of conditions: first, when x has any distribution for which the 
first four moments exist; second, when x is normally distributed. 
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It is evident from a study of Table 4.2 that the ratio 
(variance of s 2 )/(mean of s 2 ) 2 

is always less in the case of s 2 r than in the case of s 2 . The difference 
is slight, however, and for any reasonable rounding, say, a = w/a < 1, 
will generally be negligible. Therefore 

(1) When N is large, ( N — 1 )sl R /(a 2 + w 2 /12) and (N — l)$ 2 /<r 2 will 
be distributed as x 2 with N — 1 degrees of freedom to about the same 
order of approximation. 

, TABLE 4.2 

The Mean of and of the Ratio of the Variance of s\ r to the Square of the 
Mean Compared with the Corresponding Quantities for When the 
Distribution of x Is Arbitrary* and When the Distribution of x Is Normal 

(-9 


Distribution of x 1 

Arbitrary 

Normal 

Statistic 


si 

X R 

•i 

Mean. 

•i 

(*+SH 

A 



a * 


Variance 

2 . y*: x 

2 yr ' x 120 

2 

(Mean) 2 

N - 1 ^ N 

N - 1 + N 

N - 1 


* Subject to the restriction that the first four moments of x about its population mean exist. 


(2) The quantity sl R — w 2 /l2 is an unbiased estimator of the variance 
of x in the population <t 2 , whatever the value of N, provided the distribu¬ 
tion of x has high order contact (that is, Sheppard’s three assumptions, 
stated in Sec. 4.2, are fulfilled), or the rounding lattice is located at 
random with respect to the admissible values of x, or both.* 

It may be well to point out here that the ratio 

(N ~ 1)«. - w 2 /12)/a 2 

should not be used as a substitute for the ratio given in (1) unless a = w/a 
is quite small, since the variance of this ratio exceeds the variance of the 
ratio given in (1) by a factor of (1 + a 2 /12) 2 , which will generally render 

* When N is small, the mean and variance/(mean) 2 relations shown in Table 4.2 
are still valid, but the discontinuities of the distribution of s 2 Xr are such that graduation 
of the distribution of sl R by a continuous distribution is virtually impossible (see 
Sec. 5.1). 

Kendall has remarked that (2) gives the correct estimator of <r\ if the rounding of 
the values of x is determined before the sample is drawn. If, on the other hand, the 
observations xi, Xa f . . . , xh are drawn first and then rounded, he suggests that 
«l R — w*/[12(N — 1)1 be used to estimate <r 2 . When N is large the difference is 
clearly negligible from a practical standpoint but is of some theoretical interest. 
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the x 2 approximation less accurate. If the adjusted numerator and the 
denominator a 2 + w 2 /12 are used, then the variance of the resulting ratio 
will be the same as the variance of the ratio given in (1), but the mean 
will be deficient by a factor of 1/(1 + a 2 / 12). If s 2 k is used in the 
numerator and <r 2 in the denominator, the mean will be excessive by a 
factor of (1 + a 2 / 12) and the variance by a factor of (1 + a 2 /12) 2 . 
Consequently, 

(3) Determinations of confidence limits for, and statistical tests of 
hypotheses regarding, the value of a 2 should be based on s 2 ^ as implied 
by (1) and will be valid to the extent that the conditions and approxima¬ 
tions thereof obtain. 

In conclusion, it should be remarked that the foregoing discussion can 
be carried over to the case where s 2 Xr has been obtained from an analysis 
of variance of values of x that have been rounded to within ±w/2 and 
has n degrees of freedom., The only change required is replacement of 
N — 1 by n throughout. 


4.6. Effect of Rounding on the Distribution of Student’s t 


Let 


and 


VN(X- m'i:,) 


V 'N (x R — Mi:x) 


(15) 

(16) 


where x and x R are as in (1) and (11), s £ and s Xr are as in (2) and (14), 
n' 1:x is the mean value of x in the population, and N is the number of 
independent values of x (or of x R ) in the sample. 

When the N independent observations involved are from a normal 
distribution, the distribution of t x is known as “Student’s t dist ributi on,” in 
recognition of the fact that the distribution of z = t/y/N — 1 under 
these conditions was first given by Student in the epochal paper from 
which the quotation in Sec. 1 of this chapter is taken. In his derivation, 
Student utilized the independence of x and 5 when computed from 
independent observations from a normal population, although he himself 
merely established the fact that the product moment coefficients of 
correlation between x and s, and between x 2 and s 2 , are zero under these 
conditions. Independence of x and s is actually a necessary condition. 
A rigorous proof of their independence in random samples from a normal 
population was communicated to Student by R. A. Fisher in correspond¬ 
ence prior to the First World War; this first saw light, only incidentally, 
as a preliminary step in Fisher’s paper, published in 1915, on the exact 
sampling distribution of the correlation coefficient.* The first complete 

* Fisher, “ Distribution of the Correlation Coefficient,” 
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derivation of Student’s distribution was given, in terms of t , by Fisher 
in 1926,* although he had given a more general definition of t two years 
earlier, f 

As Student pointed out in the passage quoted in Sec. 1 of this chapter, 
there is a correlation between the sample mean and the sample standard 
deviation in random samples of rounded observations, resulting from the 
rounding, so that x R and s Xr will not be independently distributed even 
when x and s z are independent. Nevertheless, Student’s experimental 
results suggest that the distribution of t XR will closely approximate that 
of t x when N is large enough to render negligible the discontinuities of 
the distribution of t XR . That this is so is evident from a study of the 
first four moments of t XR for the case of x having $n arbitrary distribution 
subject only to the condition that its first four moments exist. The 
first four moments of t* R can be evaluated readily, under this assumption 
regarding x } from the expressions for the first four moments of t in terms 
of the semi-invariants of x , given by Geary4 The first four moments of 
t XR are given in Table 4.3, where they arc compared with the first four 
moments of Student’s t distribution, that is, the moments of t x when x 
is normally distributed. § 

It is evident from a study of Table 4.3 that, to terms of order 1 /N, 
rounding affects only the fourth moment, and that the effect is clearly 
negligible in the case of samples from a normal population ( 71 :* = 0 and 
72 :* = 0) for any reasonable rounding, say, a = w/<t < 1 . It can be 
seen from a study of Geary’s more extensive formulas that, when the 
distribution of x is symmetrical but not necessarily normal, the first 
and third moments of 1 Xr are zero through terms in l/N 2 at least and the 
variance of 1 Xr has its normal value through the terms in l/N. Indeed, 
when x is normally distributed, the variance of t XR has its normal value 
through terms in l/N 2 also. Therefore, it may be stated that 

When N is sufficiently large and w is sufficiently small to render 
the discontinuities of the distribution of t XR negligible, the distribution of 
t XR and t x will follow the standard t distribution to about the same order 
of approximation, provided the distribution of x has high order contact, 
or the rounding lattice is located at random with respect to the admissible 
values of x, or both. 

* Fisher, “Applications of Student’s Distribution.” 

t Fisher, “Error Function of Well-Known Statistics.” 

I Geary, “Distribution of Student’s Ratio.” 

$ The moments of t XR are given to terms of order l/N] that is, terms in the % and 
higher powers of l/N have been neglected. Geary’s general expressions for the first 
four moments are correct to terms of order l/N 2 , The terms of order l/N** and 
l/N 2 for t Xjt can be evaluated with the aid of the semi-invariants of x R given in the 

footnote to expressions (10) above. 



TABLE 4.3 

Fibst Four Moments of the Distribution of Student’s t When t Is Based on a Sample of N Independent Rounded Observations 
from Any Population with Finite First Four Moments, Compared with the Corresponding 
Moments of the Standard t Distribution 


202 


TECHNIQUES OF STATISTICAL ANALYSIS 


Chap. 4 




Sec. 5.2.1 


ROUNDING DATA 


203 


It should be noted that $ 2 r , and not its adjusted value given in 
Paragraph (2) of Sec. 4.4, is to be used in the denominator of t XR . If the 
adjusted value is used, giving 




VK (*« - 

\ 12 


(17) 


then, since > |/ X J, it is clear that significant values of t' XR will arise 
too frequently. Furthermore, when sl R is less than «> 2 /12, the value of 
t XR will be imaginary, and in such cases some convention regarding the 
value of t XR to be recorded will be needed. In other words, as Fisher 
has stated, “These [Sheppard’s] adjustments should be used for purposes 
of estimation, but not usually for tests of significance.”* 

The foregoing discussion carries over to the case where s Xr has been 
obtained by an analysis of variance, or otherwise, from values of x 
rounded to within ±w/2 and has n degrees of freedom, provided the 
situation is such that x and s\ may be regarded as independently dis¬ 
tributed. The only change required is to replace N — 1 by n throughout. 


6. EFFECTS OF ROUNDING ON STATISTICAL ANALYSES OF 
SMALL NUMBERS OF OBSERVATIONS FROM A NORMAL POPULATION 

5.1. Introduction 

Although the results obtained above are independent of the value of 
AT, the discussion up to this point has been concerned mainly with 
cases involving a large number of observations. An exception is the 
quotation from Student in Sec. 1, which discusses in detail the effects of a 
coarse rounding on samples of 4 from a nearly normal population. From 
this passage it is clear that in the case of small samples the discontinuities 
of the distributions of Xr and sl R are the most serious effect of the round¬ 
ing. For a fixed N f the discontinuities of the distributions of x R and s* R 
may be expected to increase in number (in any finite interval) but to 
decrease in seriousness as the width of the rounding interval w decreases, 
and, for a fixed w, to increase in number but decrease in seriousness as 
N increases; hence, it is clear that largeness of N has meaning only in 
conjunction with smallness of w. 


5.2. Exact Distributions of s| g in Samples of 2 and 3 from a Normal 
Population 

5.2.1. Illustrative Distributions.—In order to throw light on the 
joint effect of w and N on the distribution of in small samples from a 


Fisher, Statistical Methods , Sec. D of Appendix to Chap. III. 



204 TECHNIQUES OF STATISTICAL ANALYSIS Chap. 4 

normal population, exact distributions of s 2 h in samples of 2 from a normal 
population have been computed for w = a, <r/ 2, and <r/4. These are 
portrayed in Figs. 4.1, 4.2, and 4.3, where they are compared with 



Fig. 4.1.—Distribution of *! B for samples of 2 from a normal population when the 
width of the rounding interval is one standard deviation, and corresponding continuous 
distributions of *J and 8 J = (13/12)*!. 

the distribution of si, which is distributed as x 2 <t 2 for 1 degree of freedom. 
In Fig. 4.1 the distribution of 

s * = {} + 12) 8 * 

is shown also, in view of (1) in Sec. 4.4. 


(18) 
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The large steps in the distributions of s* R shown in these figures 
emphasize the discontinuous character of these distributions. It 
follows from (1) of Sec. 4.4 that when N is large the distribution of s'* may 
be expected to graduate the distribution of sl R more closely than will 



% z /<rl 

Fig. 4.2.—Distribution of for samples of 2 from a normal population when the 
width of the rounding interval is one-half standard deviation, and corresponding con¬ 
tinuous distribution of «J. 

the distribution of sj. From Fig. 4.1 it is clear that the improvement is 
negligible when N = 2 and w - <r. Furthermore, it is clearly impossible 
to find smooth curves that will approximate these step functions closely 
at all points. In other words, the probability of **, or s x , equaling or 
exceeding a value u will approximate closely the probability of 8* r 
equaling or exceeding u only for certain values of u , principally those that 
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sampling is from a normal population. These two positions have been 
chosen as follows: 

Position I: The mean of x in the population coincides with the 
boundary between two of the rounding intervals. In the notation 
of the footnote, this means that if the mean of x in the population 
is chosen as the origin, so that n' 1:x = 0, then 0 = —(2 i — 1)tt, 
where i is some positive or negative integer, or zero. 

TABLE 4.4 

Exact Distributions of sl R = (xr x — xrJ 2 /2 for the Extreme Positions of the 
Bounding Lattice When the Width of the Rounding Interval Is One 
Standard Deviation and Xi and x 2 Are Independent Observations 
from a Normal Population 

* (N - 2, w - <r x ) 


u* 

s 2 

Probability that > u 


Position 1 

Position II 

0 

1.0000 000 

1.0000 000 

V2 

0.7291 077 

0.^290 858 

2 

0.2987 666 

0.2987 802 

% 

0.0832*659 

0.0832 609 

8 

0.0152 891 

0.0152 903 


0.0018 126 

0.0018 124 

18 

0.0001 368 

0.0001 368 


0.0000 065 

0.0000 065 

32 

0.0000 002 

0.0000 002 

Over 32 

0.0000 000 

0.0000 000 

E(sl B ) 

1 

1.0833 332 

1.0833 334 

V(sl g ) 

<rj 

2.3430 509 

2.3430 491 


♦ The values of u shown include every possible value up to 32. 


Position II: The mean of x in the population coincides with 
the mid-point of a rounding interval. Thus 6 = — 2tt t, where i is 
some positive or negative integer, or zero. 


* i = ( i + i) w 

0 may be used to define the location of the rounding lattice. Thus 0 =■ 0 when the 
mid-point of the 0th interval is at the origin (x * 0), and 0 — it when the lower 
boundary of the 0th interval is at the origin. Clearly, the rounding is the same for 
0*— ^ and 0 - ^ ± 2far, where k is an integer. 
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Table 4. 7 gives to 6 decimal places the exact distributions of for 
N - 3 and w - <r, corresponding to these two positions of the rounding 
lattice, when the sampling is from a normal distribution. 

TABLE 4.5 

Exact Distributions of »I k = (zx, — a;®,) */2 for the Extreme Positions of the 
Rounding Lattice When the Width of the Rounding Interval Is 
One-half Standard Deviation and xi and x 2 Are Independent 
Observations from a Normal Population 


(lY = 2, «,=!£) 


u* 

gS 

Probability that ^ > u 

u* 

g2 

Probability that ^ > u 

■ 

Position 1 

. 1 

Position II 

Position I 

Position II 

0 

1.0000 000 

1.0000 000 

i6 % 

0.0000 110 

0 0000 110 

% 

0.8604 037 

0.8604 037 


0.0000 021 

0.0000 021 

X 

0.5977 898 

0.5977 898 

22 X 

0.0000 004 

0.0000 004 

% 

0.3792 359 

0.3792 360 

32 

0.0000 001 

0.0000 001 

2 

0.2183 109 

0.2183 110 

Over 32 

0.0000 000 

0.0000 000 

2 % 

0.1134 753 

0.1134 754 

E{»1) 



% 

0.0530 515 

0.0530 515 

X R 

2 

1.0208 312 

1.0208 314 

4 H 

0.0222 398 

0.0222 398 

<*x 



8 

0.0083 396 

0.0083 396 

T7Y_2 \ 



s x 

0.0027 919 

0.0027 919 

V K> 

2.0839 751 

2.0839 755 

2 X 

0.0008 331 

0.0008 331 




12 X 

0.0002 213 

0.0002 213 




18 

0.0000 523 

0.0000 523 





* The values of u shown include every possible value up to 32. 


The probabilities given in these tables were evaluated with the aid of a 
15-place table of the probabilities for normal deviates,* and are believed 
to be correct to within 1 unit in the last place shown. The calculations 
were carried to 10 decimals or more at all stages and were carefully 
checked. The individual probabilities obtained, when rounded to the 
number of places shown, sum to unity within +1 in the last place shown. 

From these tables, and the fact that Positions I and II represent 
opposite extremes in their effects on s 2 g , the following conclusions may be 
drawn: 

(1) When 2V = 2, the probability that sl B /<r 2 equals or exceeds u 
is unaffected to 4 decimal places by the position of the rounding lattice 
if w = <r; to 7 decimal places, in general, if w < <r/2. 

(2) When N — 3, the probability that sljo 2 equals or exceeds w 

* Mathematical Tables Project, Tablet of Probability Function*. 
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is unaffected by the position of the rounding lattice to 3 decimals if 
w — <T. 

(3) The‘effect of the position of the rounding lattice on the distribu¬ 
tion of $l R appears to increase with the number of observations involved, 
that is, with N. 


TABLE 4.6 

Exact Distributions of = ( x — XR t ) 2 / 2 for the Extreme Positions of the 
Rounding Lattice When the Width of the Rounding Interval Is 
One-quarter Standard Deviation and xi and x 2 Are Independent 
Observations from a Normal Population 


(n- 2,w=f) 


u* 

s 2 

Probability that ^ > u 

u* 

8 2 

Probability that ~ > u 
<x* 

Position 1 

Position II 


Position 1 

Position II 

0 

1.0000 000 

1.0000 000 

*y s 

0.0020 043 

0.0020 043 

%2 

0.9296 594 

0.9296 595 

3 % 

0.0010 902 

0.0010 902 

H 

0.7911 480 

0.7911 481 

M A 

0.0005 759 

0.0005 759 

%2 

0.6589 474 

0.6589 474 

44 Hz 

0.0002 954 

0.0002 954 

Vz 

0.5366 322 

0.5366 322 

12 Vs 

0.0001 471 

0.0001 471 


0.4269 273 

0.4269 273 


0.0000 711 

0.0000 711 

% 

0.3315 445 

0.3315 445 

18 

0.0000 333 

0 0000 333 

*%2 

0.2511 525 

0.2511 525 

625 Az 

0.0000 151 

0.0000 151 

2 

0.1854 693 

0.1854 693 

I8 % 

0.0000 066 

0.0000 066 

% 

0.1334 464 

0.1334 464 

72 ^2 

0.0000 028 

0.0000 028 

2 % 

0.0935 041 

0.0935 041 

*% 

0.0000 011 

0.0000 011 

l % 

0.0637 758 

0.0637 758 


0.0000 004 

0.0000 004 

% 

0.0423 269 

0.0423 269 

22 % 

0.0000 001 

0.0000 001 

l ®%2- 

0.0273 253 

0.0273 253 

Over 

0.0000 000 

0.0000 000 

. *% 

0.0171 541 

0.0171 541 

E(sl) 



22 %2 

0.0104 691 

0.0104 691 

R 

2 

1.0052 058 

1.0052 058 

8 

0.0062 099 

0.0062 099 

* X 



N 

oo 

CJ 

0.0035 793 

0.0035 793 







A ! 

2.0207 869 

2.0207 869 


♦ The values of u shown include every possible value up to 225/8. 


The mean and variance of for each of the situations considered 
in Tables 4.4 to 4.7 are given in these tables. These values are brought 
together in Table 4.8, where they are compared with the corresponding 
average values computed from the formulas of Table 4.2. When sl R 
# is based on N independent (rounded) observations from a normal dis¬ 
tribution, then the formula of Table 4.2 for the average variance of 8* r 
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becomes 

average variance of (l + (19) 

where a = Equation (19) gives the average value of the variance 

of sl R in a series of repeated trials in each of which N independent 

TABLE 4.7 

3 

Exact Distributions of =* ^ (xr { — £r) 2 / 2 for the Extreme Positions of 

»=i 

the Rounding Lattice Whei^ the Width of the Rounding Interval Is 
One Standard Deviation and xi, Xt, and Are Independent 
Observations from a Normal Population 
(N «« 3, v> = <r x ) 


u* 

■ »* ' 
Probability that ^ > u 

1 - 

u* 

8 t 

Probability that ~^>u 

Position 1 

Position II 

Position I 

Position II 

0 

1.000 000 

1.000 000 | 

9 

0.000 401 

0.000 401 

y 3 

0.915 415 

0.915 156 

2 A 

0.000 278 

0.000 278 

1 

0.541 194 

0.541 514 

3 H 

0.000 098 

0.000 098 

% 

0.339 213 

0.338 898 

12 

0.000 027 

0.000 027 

y 3 

0.190 327 

0.190 245 

3 % 

0.000 019 

0.000 019 

3 

0.071 985 

0.072 096 

13 

0.000 008 

0.000 008 

4 

0.040 097 

0.040 103 


0.000 002 

0.000 002 

13 A 

0.027 447 

0.027 411 

16 

0.000 001 

0.000 001 


0.008 814 

0.008 808 

Over 16 

0.000 000 

0.000 000 

"A 

0.005 122 

0.005 122 

E(sl) 



7 

0.002 200 

0.002 206 

R 

2 

1.083 330 

1.083 336 

2S A 

0.000 629 

0.000 629 

** 






VU') 






R 

1.170 804 

1.170 831 


* The values of u shown include every possible value up to 16. 


observations are drawn at random from a normal distribution, the posi¬ 
tion of the rounding lattice being determined at random independently 
for each trial. Since the average value of the mean of sl B is unaffected 
by the nature of the distribution of x } its value is given by the formula 
of Table 4.2 in any case. Prom Table 4.8 it is seen that 

* When N is large so that (N — 1)/N = 1, formula (19) agrees through the term 
in a 4 with the expression given by R. A. Fisher, “ Mathematical Foundations,” p. 362, 
N 

for the variance of ^ (xr. — p[ :x ) 2 /N computed from N independent rounded observa- 

»*■ l 

tions from a normal population. 
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(1) When N is 2 or 3 and w < a, the mean values of sl R for Position I 
and Position II of the lattice agree with each other and with their 
theoretical average values, to 5 decimal places or more. 

(2) When N is 2 or 3 and w = <r, the variances of s 2 ^ for Position I 
and Position II of the lattice agree with each other and with their theo¬ 
retical average values to 2 decimal places or more; when w < <r/2, to 
3 decimal places or more. 


TABLE 4.8 

Comparison op Actual Values op the Mean and Variance op with the 

Values Implied by the Formulas op Table 4.2 When the Width op the Rounding 
Interval Is One Standard Deviation (and Less) and 8\ b Is Based on Two or 
Three Independent Observations from a Normal Distribution 


Number of 
observations, 
N 

Width of 
rounding 
interval, 
a units 

Lattice 
position * 

Mean value of /< 

Variance of 

Actual 

By formula 

Actual 

By formula 

2 

1 

I 

1.0833 322 

1.0833 333 

2.3430 509 

2.3423 322 


i 

II 

1.0833 334 

1.0833 335 

2.3430 491 

2.3423 322 


H 

I 

1.0208 312 

l.g 208 335 

2.0839 751 

2.0839 300 


X 

II 

1.0208 314 

1.0208 335 

2.0839 755 

2.0839 300 


X 

I 

1.0052 058 

1.0052 085 

2.0207 869 

2.0208 711 


H 

II 

1.0052 058 

1.0052 085 

2.0207 869 

2.0208 711 

3 

l 

I 

1.083 330 

1.083 335 

1.170 804 

1.170 351 


l 

II 

- 

1.083 336 

1.083 335 

1.170 831 

1.170 351 


* For the exact meanings of Position I and Position II, see Section 5.2.2. 


Fisher* has studied the effect of the position of the rounding interval 
on the mean values of x R and sl R in samples from a normal distribution 
using an analytical approach based on the periodic character of such 
effects noted in the first footnote of this section. He has calculated 
that the bias of x R due to the position of a rounding lattice with intervals 
one standard deviation wide will be less than one-tenth the standard error 
of x R for values of N less than 1.379 X 10 16 . For s* B the corresponding 
value of N is 1.75 X 10 14 ; for the variance of s 2 ^, it is 1.34 X 10 12 . 

From (1) and (2) above and from Fisher’s analysis, we conclude that 

(3) The effect of the position of the rounding lattice on the mean and 
variance of x R and sl R in samples from a normal distribution is minute 
in comparison to the average effects of the rounding. 

(4) In small samples (from a normal distribution at least), so far as 
the approximation of the distributions of x R and 8* b by continuous 
distributions is concerned, the discontinuities of the distributions of x R 

* Fisher, “Mathematical Foundations,” pp. 318-321. 
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and sl R , caused by the rounding, are far more serious than the effects 
of the position of the rounding lattice. 

6.3. Combinations of Sample Sizes and Rounding-interval Widths for 
Which the Actual Distribution of s% R Can Be Approximated Satis¬ 
factorily by a Continuous Distribution, When the Sampling Is From 
a Normal Population 

We have not found a rigorous answer to the question of how large 
the sample size N needs to be for a given rounding-interval width w and 
how small w needs to be for a given value of N before the x 2 approxima¬ 
tion given in Paragraph (1) of Sec. 4.4 will adequately represent the dis¬ 
tribution of sl R for most practical purposes. Since the discontinuities of 
the distribution of sl R are the principal source of the difficulties encoun¬ 
tered in attempting to represent the actual distribution of s% R by a con¬ 
tinuous distribution when K is small and w is comparatively large, it is 
clear that an answer depends on finding the combinations of N and w for 
which the discontinuities of the distribution of s% r over any given interval 
are sufficiently numerous and the steps sufficiently small to render them 
negligible. 

TABLE 4.9 

Probability That a Zero Value of Will Be Obtained When 8 2 Xr Is Based on 
N Independent Rounded Observations from a Normal Distribution 


Number of 

Width of rounding interval in <r units (a 

= w/<t) 

observations, N 

1 

y 2 

H 

H 

2 

0.271 

0.1396 

0.0703 

0.0302 

3 

0.085 

0.0225 

0.0057 

0.0014 

4 

0.028 

0.0038 

0.0005" 

o.oooi- 

5 

0.009 

0.0007 



6 

0.003 

0.0001+ 



7 

0.001+ 





The combinations of N and w for which the discontinuities of the 
distribution of s* R are negligible when the sampling is from a normal 
population could be determined by preparing, for wide ranges of values 
of N and w 1 tables of the actual distribution of s* R analogous to Tables 
4.4 to 4.7. The arithmetical labor of the calculations involved in the 
preparation of such tables is so enormous, however, that it seems advisable 
to devise some rule of thumb, which is intuitively sound, as a substitute 
for rigorous answers until the latter become available. Accordingly, 
we propose that the probability of sl R equaling zero for given values of N 
and when the sampling is from a normal population, be used as a basis 
for deciding whether a given pair of values of N and w is satisfactory. To 
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be more explicit, we propose that < 0.001, evaluated 

on the assumption that the sampling is from a normal population, be 
adopted as a 'tentative definition of the combinations of values of N 
and w for which the x 2 approximation given in (1) of Sec. 4.4 provides a 
satisfactory approximation to the distribution of s* R for practical 
purposes. 

Values of P{st R = 0|iV,ii>}, for sampling from a normal population, 
are given in Table 4.9 for various combinations of values of N and w f 
the latter being expressed in units of the population standard deviation. 
It appears from this table that satisfactory combinations, where a = w/a , 
are 


a < 

Hoo, 

N 

> 

2 

(by extrapolation) 

a,< 

Ho, 

N 

> 

3 


a < 

H, 

N 

> 

4 

(20) 

a < 

H, 

N 

> 

5 


a < 

H, 

N 

> 

6 


a < 

i, 

N 

> 

7 



The fact that the combinations (20) represent a somewhat arbitrary 
definition of the circumstances under which (1) and (3) of Sec. 4.4 may be 
expected to yield satisfactory results in practical work should not be lost 
sight of. Nor should the combinations (20) be regarded as competing 
with, or contradicting, the general recommendation, cited in the first 
paragraph of Sec. 4.1, that w be taken less than <r/3, or better, less than 
<r/4. The object of this general recommendation is to limit the inevitable 
loss of precision of estimates of parameters caused by rounding, and 
relates to the case where the choice of a rounding interval is yet to be 
made. Combinations (20), on the other hand, are not concerned with 
the loss of precision of estimates as a consequence of coarse rounding 
but are proposed as a rough indication of the circumstances under which 
data can be validly analyzed by simple modifications of standard 
procedures. 

5.4. Distribution of Student’s t and the Variance Ratio When Based on 
a Small Number of Rounded Observations 

Like the distributions of all functions of rounded observations, the 
distribution of t XR , defined in equation (16) of Sec. 4.5, is discontinuous. 
However, since the discontinuities of the distribution of x R and the 
distribution of $ Xr jointly affect the distribution of t XR) its discontinuities 
are more numerous in any given interval than those of the distributions 
. of x R and 8 XrJ and, generally, the steps are smaller than those of the dis¬ 
tribution of 8 Xr . On the other hand, whenever s Xr equals zero, this 
causes t XR to be either + oo or — », according to the sign of the numerator 
with the exception of those cases in which £* equals the population mean 
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when t XB can be defined to be equal to +1 or —1, the sign being 
determined in a given instance by a random mechanism giving each sign 
equal probability.* The last-mentioned situation cannot occur unless 
n[. x is a multiple of 1/N and will occur very rarely even when it is a 
possibility. On the other hand, it can be seen from Table 4.9 that infinite 
values of t XR may occur with annoying frequency in very small samples, 
for example N < 4, when coarse rounding is involved. If, however, 
the rounding employed with a given N is in accord with the recommenda¬ 
tions (20), then the probability of an infinite value of t XR will be less than 
0.001. Furthermore, the rouhding so chosen will simultaneously render 
the steps in the distribution of t Xjt negligible, so f bat for practical purposes 
t XR may be regarded as following the standard t distribution, which is 
strictly valid only for t x . 

Similar considerations apply to the distribution of the ratio F R of two 
mean squares, and; which are such that in the absence of the 
rounding n\s\ and n 2 s| would be distributed independently as x 2 * 2 for ni 
and ri 2 degrees of freedom, respectively. In particular, F R = s\ Xr /s\ Xr 
will equal 0 whenever s\ 9r is 0; it will equal infinity whenever s\ Xr is 0; 
it may be defined to be 1 when both s\ Xr and are 0. Four exact 
distributions of F R were calculated from the distributions of s\ R given in 
Tables 4.4 to 4.7. The cases considered were ni = n 2 = 1, w = or, <r/2, 
<r/4; ni = n 2 = 2, w = <r. As expected, the discontinuities of F R were 
more numerous in any given interval than the discontinuities of the dis¬ 
tribution of sl R alone, and the steps were smaller. The biggest steps 
were those corresponding to F R = 0, F R = 1, and F R = + <». From 
the graphs that were sketched it was apparent that if these three steps 
were reduced to negligible size, either by increasing n\ and n 2 or by 
decreasing w, then the distribution of F R would be represented by the F 
distribution to an approximation sufficient for most practical purposes. 
It is believed that if the Ni (= ni + 1), N 2 (= n 2 + 1), and the common 
rounding interval satisfy the recommendations (20) above, then F R may 
be regarded as distributed according to the F distribution for most prac¬ 
tical purposes. 

If sl XR and s£ Xji correspond to different roundings Wi and W 2 , then the 
distribution of F R will be biased relative to the distribution of F . This 
bias cannot be eliminated when <r 2 is unknown, the situation in which F 
tests are customarily employed. 

6. STATISTICAL PROPERTIES OF TABULAR AND INTERPOLATED VALUES 

It is often stated, or implied, in statistical texts that the error of a 
tabular entry may be regarded as a random variable distributed with a 

* The complication of this definition arises because F — % » 1 seems a sensible 
definition, and t « % ** 0 also seems sensible, but t* F. The complication avoids 
the implication that 0* — 1. 
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uniform probability density between minus unit and plus Y unit in the 
last place. Clearly the rounding error of a given tabular entry has a 
definite value, although it may be unknown to a user of the table. Hence 
the error of a given tabular entry is not a random variable in any meaning¬ 
ful sense of the expression, its distribution being concentrated at a point. 
Therefore, the statement appearing in statistical texts must be intended 
to mean that when one is carrying out computations involving the use 
of a large number of different tabular numbers, the errors of the indi¬ 
vidual tabular numbers considered collectively will, under favorable 
circumstances, range from minus unit to plus Y unit in the last place 
with approximately uniform density. 

The circumstances may sometimes be quite unfavorable. Thus in a 
given computation involving the use of numbers from only a small 
section of a table—and in certain other less easily specified cases also— 
the errors may all tend to have the same sign or to be bunched in some 
other manner characteristic of the values of the function when expressed 
to the given number of decimal places for that particular range of the 
argument. Indeed, the favorability of the circumstances may depend 
on the units in which the function is tabulated, as the following parallel 
portions of two logarithm tables illustrate: a 


N 

logio N 

log, N 

9861 

3.9939 210 

9.1963 429 

9862 

-3.9939 650 

9.1964 443 

9863 

3.9940 090 

9.1965 457 

9864 

3.9940 531 

9.1966 470 

9865 

3.9940 971 

9.1967 484 

9866 

3.9941 411 

9.1968 498 

9867 

3.9941 851 

9.1969 511 

9868 

3.9942 291 

9.1970 525 

9869 

3.9942 731 

9.1971 538 

9870 

3.9943 172 

9.1972 551 

9871 

3.9943 612 

9.1973 564 

9872 

3.9944 051 

9.1974 577 

9873 

3.9944 491 

9.1975 590 

9874 

3.9944 931 

9.1976 603 

9875 

3.9945 371 

9.1977 616 

9876 

3.9945 811 

9.1978 629 

9877 

3.9946 251 

9.1979 641 

9878 

3.9946 690 

9.1980 653 

9879 

3.9947 130 

9.1981 666 


These logarithms are given above correct to 7 decimal places. From an 
examination of the digits in the seventh place in the case of the values of 
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logio N, it is apparent that if one is calculating with the aid of logarithms 
read from a 6-place table of logio N, and the values of N involved are 
limited to the range shown above, the errors of the logarithms will be 
bunched around a value of one-tenth unit in the sixth place and quite 
definitely will not be distributed more or less uniformly from minus 
}4 unit to plus }/% unit in the sixth place. On the other hand, if the 
computations are carried out with the aid of a 6-place table of log* N, then 
even though the values of N involved are confined to the range shown 
above, the errors of the logarithms may be fairly uniformly distributed 
between minus % unit and plus unit in , the sixth place, although 
there will be a gap in the distribution of the error in the vicinity of 
two-tenths unit in the last- place. 

Since there is a definite relationship between the exact values of a func¬ 
tion for successive values of the argument, there is likewise a definite 
relationship between the errors due to rounding in the tabular values 
given correct to n decimal places. Hence the errors due to rounding to 
n places are never actually equivalent to independent observed values of a 
random variable uniformly distributed between minus }/% and plus 3^ 
unit in the last place retained. Nevertheless, if a large number of entries, 
corresponding to widely dispersed values of the argument, are examined, 
the figures beyond the /ith place will often form decimal fractions that 
appear to be more or less uniformly and independently distributed 
between 0 and 1 in units of the last figure retained; in some cases, this is 
true even for relatively small values of n. Furthermore, when the 
function tabulated is irrational or transcendental, the larger the value 
of n, the more nearly do the decimals beyond the nth place form decimal 
fractions that seem to be uniformly and independently distributed, for a 
given spacing of the argument. There are exceptions, of course. For 
example, the functions 10 m /9, 10 m /99, 10 m /999, etc. for m = 0, ±1, ±2, 

. . . , which have only (Vs and l’s in their decimal expansions, do not 
exhibit either of the aforementioned properties. Similarly, tables of 
good functions may have some bad patches. However, the fact that 
the above properties are quite common, especially in the case of decimal 
expansions of irrational and transcendental numbers, suggests that in 
many cases it may be useful to regard the errors in tabular numbers due 
to rounding as uniformly and independently distributed between plus % 
and minus unit in the last place tabulated—provided one exercises 
reasonable judgment in applying the deductions from this assumption 
in individual cases. 

Let us assume then, with the foregoing reservations, that when the 
error of a tabular number due to rounding lies between ±w/2 , the error 
€ is distributed with uniform probability density between ±w/2 . It 
follows immediately that 
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= variance of errors 


1 f 2 

Mean error = — / = 0 

™ J t£ 

2 

-if 

w J 


■ dt = 


w‘ 

12 


0.0833m> 2 


( 21 ) 

( 22 ) 


whence the standard deviation of the errors is 


v, = = 0.2887u> 

VI2 


(23) 


Consequently, if rounding errors are viewed from this statistical view¬ 
point, equation (21) implies that rounding is unbiased, that is, the average 
difference between the rounded tabular entry and the true value of the 
function will be zero. Maintaining the statistical viewpoint, it follows 
that the sum of the errors of k entries that are “correct” to within 
±w/2 , assuming independence of the errors , will b e dist ributed about 
a mean of zero with a standard deviation of w \^k/\2; furthermore, 
the distribution is approximately normal (Gaussian) for k > 4.* 

As an example of the use of this result, consider a tabulation, sup¬ 
posedly correct to 5 decimals, of the nine terms in the expansion of 
(P + #) 8 f° r V == ir/6« The sum of the nine values is 1.00003. Does 
this discrepancy of 0.00003 from the true sum suggest that a mistake 
has been made in the calculations? The maximum admissible dis¬ 
crepancy is bounded by ±0.000045, so there is no positive proof that the 
calculations are in error. The observed discrepancy, measured in units 
of 0.00001, corresponds approximately to a normal deviate of 


(3 ~0) 

VK* 


3.46. 


Since the probability of a normal deviate differing from 0 by more than 
2.576 is less than 0.01, f it is advisable to check the calculations (prob¬ 
ably starting with the sum). 

Fisher and Wishart $ have used the statistical viewpoint to determine 
the distribution of the error of an interpolated value when the values 
of the argument are equally spaced. Let the spacing of the argument be 
taken as unity, and let 0 to 1 represent the interval containing the value 

* For details, see the papers by Baten, Hall, Irwin, and Rietz listed in Sec. 7. 
t This may be verified from Table 1.1 of Chap. 1, since the probability is 2(0.005) 
ti&t a normal deviate will be either above #0.005 or below — #0.005. 

t Fisher and Wishart, “On the Distribution of the Error of an Interpolated 
Value.” 
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of the argument p for which it is desired to obtain an interpolated value; 
then, if the interpolation is performed without recourse to differences, 
using Lagrangian interpolation coefficients, * the error of the interpolate 
can be expressed simply in terms of the tabular entries used to obtain it. 
It is, in fact, the “interpolate” of the errors: 

Error of linear (2-point) interpolate = (1 — p)e o + pe\ 

Error of quadratic (3-point) interpolate 

= p(p - l)€-i + 2(1-- p 2 )e 0 + p(p + l)ci 
2 

i m (24) 

Error of (2m)-point interpolate ^ ^ A\ m {p) • e* 

k *■» — m-j-1 ~" 

m 

Error of (2m + l)-poiut interpolate = ^ Al m+1 (p) • €* 

k"" —m 

i 

If the errors of the tabular entries may be regarded as independent 
random variables with a common variance <r 2 , then the variance of an 
ir-point interpolate for argument p is simply 

V(p,r) - ^ [AM V )Y ■ <r 2 (25) 

where the summation is over k from — m + 1 to m when r = 2m, and 
over k from — m to +m when r = 2m + 1. In particular f 

V(p, 2) = variance of 2-point Lagrangian interpolate (26) 
= [(1 - v y + pV = [1 - 2p(i - p)W 0 < p < 1 

and 

F(p,3) — variance of 3-point Lagrangian interpolate (27) 
_ [p 2 (P ~ l) 2 + 4(1 — p 2 ) 2 + p 2 (p + l) 2 ]<r 2 
4 

» [6p*(p* - 1) + 4]<r g Q< <L 
4 — y — 

It can be determined readily by differentiation that the minimum 
value of F(p,3) corresponds to p = ± l/y/2 and equals 

^ = 0.62500<r 2 

♦ For numerical values of these coefficients for 3-point up to 11-point interpolation, 
see Mathematical Tables Project, Tables of Lagrangian Interpolation Coefficients; 
formulas for these coefficients are given in the introduction to the tables. 

f Expression (26) is identical to that given by Fisher and Wishart, p. 914. In 
place of (27) they give an equivalent expression in terms of a variable p that varies 
from 0 to 1 when the p of (27) varies from -H to 
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which exceeds the minimum value of 7(p,2), namely <r 2 /2 = 0.50000or 2 , 
which corresponds to p = Also, for 0 < p < 1, it is clear that the 
maxima are .7(0,2) = 7(0,3) = 7(1,2) = 7(1,3) = <r 2 . Table 4.10 
shows the minima and the average values of 7(p,r), 0 < p < 1, in the 
case of (2m)-point interpolation (m = 1 , 2, 3, 4), and the practical 
minima* and the average values for — }4<p<% in the case of 
(2m + l)-point interpolation (m = 0, 1, 2, 3). 

TABLE 4.10 

Minimum and Average Values of the Variance of an t-point Interpolate for 
an Argument between 0 and 1 When r Is Even, and between —}4 and When 

r Is Odd, the Values of the Argument Being Spaced at Unit Intervals and the 
Errors of the Tabular Entries Being Independently Distributed with a 

Common Variance <r 2 


Interpolate 

Minimum Variance* 
(units of o- 2 ) 

Average Variance f 
(units of o’ 2 ) 

1-point (an entry) 

1.00000 

1.00000 

2-point (linear) 

0.50000 

0.66666 

3-point 

0.71875 

0.89375 

4-point 

0.64062 

0.77566 

5-point 

0.74060 

' 0.89574 

6-point 

0.70599 

0.82244 

7-point 

0.76579 

0.90291 

8-point 

0.74533 

0.84942 

infinite-point 

1.00000 

1.00000 


* In the case of (2m)-point interpolation, the values shown are actual minima, occurring atp ■ Hi 
and in the case of (2m + l)-point interpolation, they are practical minima, occurring at p — 

For explanation, see text. 

f From Fisher and Wishart, pp. 914-915. The average has been taken over the interval 0 < p < 1 
in the case of (2m)-point interpolates; over the interval — K < P < +H in the case of (2m 4- l)-point 
interpolates. 


In summary, if the errors of the tabular entries may be regarded as 
independent random variables with a common variance <r 2 , if the values 
of the argument are uniformly spaced at unit intervals, and if r-point 
interpolation for the interpolate corresponding to argument p, with 
0 < p < 1 for r even and —%< p < +3^ for r odd, is carried on with¬ 
out differences, then 

(1) The variance of the random error of the interpolate is always 
less than the variance of a single tabular entry. 

* In the case of (2m + l)-point interpolation for \4 < V < 1, a more accurate 
interpolate can be obtained by taking p * 1 as origin, dropping the tabular entry 
for k — —m, adding the entry for k * m + 1, and using 1 — p as argument. With 
thipconvention, V(p, 2m 4- 1) will have a practical minimum at p — ±%, although 
actual minima occur at p » ±p' with |p'| > 
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(2) The variance of the random error of the interpolate is least for 
linear (2-point) interpolation. 

(3) The variance of the random error of a (2m + 2)-point inter¬ 
polate is greater than the variance of the random error of a (2m)-point 
interpolate, but less than the variance of the random error of a (2 m + 1)- 
point interpolate. 

(4) The variance of the random error of a (2m + 3)-point inter¬ 
polate is greater than the variance of the random error of a (2 m + 1)- 
point interpolate. 

From (3) and (4) it is clear that, if (2m)-point interpolation does not 
yield the accuracy desired, but either (2 vi + l)-point or (2 m + 2)- 
point will,* then it may be advisable to employ (2m + 2)-point rather 
than (2m + l)-point interpolation in order to minimize the loss of 
precision. 

It should be noted thr.t these results require merely that the errors 
of the successive tabular entries be regarded as independently distributed 
with a common variance—it is not assumed that the errors are due solely 
to rounding and are distributed with uniform probability density between 
— w/2 and +w/2. The results are, of course, applicable to this case, 
with a 2 = w 2 /12 , as given by (22), when the independence assumption is 
tenable. Fisher and Wishart point out, however, that in this latter 
case the distribution of the error of the interpolate is more or less intrac¬ 
table, and does not tend to the normal (Gaussian) distribution (/? 2 tends 
to 2.2 and not to 3) as r increases. This somewhat limits the usefulness 
of the standard deviation as a measure of the random error of the interpo¬ 
late, although the inequalities of Tchebycheff and of Camp and Meidell 
are still applicable, t Since, by (24), the error of an interpolate is a 
linear function of the errors of the tabular entries when the interpolation 
is carried out without recourse to differences (that is, by using Lagrangian 
interpolation coefficients), the error of the interpolate will be normally 
distributed when the individual errors are independently and normally 
distributed with a common variance. Accordingly, they note that a 
table printed to one or two places more than are correct is capable of 
yielding a more accurate and more precise interpolate than the same 
table with the entries cut down, provided that the standard deviation 
of an entry is given and the errors are approximately normally and 
independently distributed about a mean of zero with a common variance. 

* The reader is cautioned against inferring that (r 4- l)-point, (r + 2)-point, 
and so forth, interpolation always yields greater accuracy than r-point interpolation; 
he is referred to Steffensen, Interpolation , pp. 34-53, for a discussion of the number of 
points to be used in a given instance and for a simple example in which the accuracy 
becomes worse as r increases. 

t See, for instance, Shewhabt, Economic Control of Quality, pp. 176-177. 
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CHAPTER 5 


ACCEPTANCE INSPECTION WHEN LOT QUALITY IS MEASURED 

BY THE RANGE 

1. TYPE OF PROBLEM 

The range, defined as the difference betweeji the largest and smallest 
observations in a sample, is familiar in industrial experimentation and 
quality control as a measure of dispersion in ’samples. Many applica¬ 
tions of the range are to normal or nearly normal populations, for which 
' the variability is better characterized by the standard deviation. The 
range is used to measure dispersion in samples because of the ease with 
which it can be determined, and because in small samples it is almost 
as efficient a measure of variability as the standard deviation.* The 
present chapter is concerned with an interesting reversal of these roles, 
where the variability of the lot or process is characterized in terms of the 
range itself but where the standard deviation is used as the sampling 
statistic in acceptance inspection to determine whether the range meets 
specified requirements. 

One type of problem in which the range is the parameter of primary 
interest is that of ensuring the virtual simultaneity of a set of events. 
This set of events may be thought of as a random sample of k units from 
an underlying population of such events, and the requirement of simul¬ 
taneity as a demand that the time difference between the occurrence of the 
first and last event of such a sample be small. In practice, of course, 
some samples will meet this requirement and others will not, but we can 
formulate the requirement statistically as follows: Let the size of the 

* One statistic is more efficient than another if it varies less from sample to sample 
for a fixed population, that is, if its sampling variability is less. To get a basis of 
comparison, it is necessary to convert values of one statistic into implied values of 
the other. For example, given the range of a sample of N from a normal population, 
an estimate of the standard deviation can be made with the aid of Tippetts table of 
the mean range, in units of the population standard deviation, for samples of N from 
a normal population. Such an estimate will vary more from sample to sample than 
if the standard deviation had been computed directly by the usual formula. Con¬ 
versely, given the sample standard deviation, an estimate of the sample range can be 
formed; such an estimate will vary less from sample to sample than if the range had 
been computed directly. For samples of 2, however, the range and standard deviation 
are equally variable; for samples of 3 the range is slightly more variable than the 
standard deviation, and as the sample size increases the variability of the range 
increases relative to that of the standard deviation. (For full citations, see Sec. 5.) 

Chap. 3, Sec. 4.4, pp. 160-163, discusses the efficiency of statistics. 

227 



228 


TECHNIQUES OF STATISTICAL ANALYSIS 


Chap. 5 


sample be k, and let U (i = 1, 2, • • • , k ) be the time of occurrence 
of the ith event of the sample measured from an arbitrary time origin. 
Denote by Rk the difference between the longest and the shortest of these 
times in a particular sample. Then the requirement for an acceptable 
lot is that in not more than a fraction y of samples of k shall the range 
Rk exceed a fixed value U. If the time can be measured for each item 
in the lot, it may be possible to form sets of k in such a way that the range 
is satisfactory in all or nearly all sets. On the other hand, if deter¬ 
mination of the time alters or destroys the unit, if the time varies from 
one trial to another for a given unit, or if measurement of the times and 
systematic grouping is expensive, there are advantages to a procedure 
such as that suggested, based on random grouping. 

A typical problem of ensuring temporal simultaneity arises in con¬ 
nection with mechanisms for which there is a certain lapse of time 
between activation—for example, closing a switch—and operation. If 
such mechanisms are used in assemblies of two or more activated from a 
single source, it may be important to the functioning of the assembly that 
these time lapses be nearly equal, in other words, that all the operations 
occur within a brief interval. This need^can be fulfilled by requiring 
that each lot of the component mechanisms be such that the range of 
delay times in random sets of k mechanisms be less than some predeter¬ 
mined constant. It is not possible, without testing the entire lot, to 
assure that every set of k formed at random from a lot will have an 
acceptable range; but it is possible by sampling inspection to obtain any 
required level of assurance, short of certainty, of detecting lots worse 
than any specified quality P, as measured by the percentage of sets of 
k in which the range is too large. 

Problems of controlling the difference between the largest and smallest 
measurements in a group of k items are not confined to situations involv¬ 
ing temporal variability. Balance may require that a set of k objects 
not show too great a range in weight; flatness or straightness may 
require that a set of k items not vary too much in thickness or length; 
appearance may depend upon the range of variation within sets of k rather 
than upon some other type of variability. 

2. METHOD OF SOLUTION 

It is clear that when acceptance inspection is based on samples, 
there is always a risk that when good lots are offered they will be rejected, 
or that when bad lots are offered they will be accepted. The former 
risk depends on the frequency with which good lots yield bad samples, and 
-the latter on the frequency with which bad lots yield good samples. 
These frequencies can be made as low as we please by taking sufficiently 
large samples and choosing the acceptance criterion properly, but they 
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cannot be eliminated except by complete inspection of the lot. Follow¬ 
ing established usage, we shall call the first kind of risk the “ producer's 
risk” and the second kind the “ consumer’s risk.” 

For concreteness, let us assume that a lot of acceptable quality 
is defined as one in which the probability is at least p x — 0.99 that the 
range of 14 items selected at random will be less than 0.3, measured in 
whatever units are appropriate. Let us suppose further that the pro¬ 
ducer’s risk is to be a small quantity, say a = 0.01; that is to say, if lots 
of quality p x are offered we wish to reject them, on the average, no 
oftener than 1 time in 100. «Suppose that a lot is unacceptable if the 
probability is p 2 = 0.85, or less, that the ran$3 of 14 items selected at 
random will be less than 0.3; and that the consumer’s risk—the proba¬ 
bility of accepting a lot of quality p 2 or worse if offered—is to be at most 
0 = 0.01. Finally, let us assume that the distribution of the quality 
characteristic is adequately represented by a normal distribution. * 

For a normal distribution the probability that the range of k inde¬ 
pendent observations is less than some preassigned number U depends 
only on the standard deviation <r. Thus, the criteria of acceptable and 
unacceptable lots, which were expressed above in terms of the proportion 
of the ranges of sets of 14 that are less than 0.3, can be expressed in 
terms of the lot standard deviation a. It follows that the acceptance- 
inspection plan can be formulated in terms of the sample standard 
deviation, or, as proves to be more convenient in practice, in terms of the 
sum and the sum of squares of the observations. This reformulation of 
the.definition of an acceptable lot permits a great gain in the efficiency 
of the inspection plan; for in using the sum of squared deviations as a 
measure of acceptability we make full use of the information provided 
by inspection of the sample, rather than of just that portion of the 
information given by selecting the extreme individuals in each group of 
14 observations. 

Let <ri be the value of the lot standard deviation that defines good 
quality. Then <r x depends on the group size k , on the acceptable value 
U of the range in groups of fc, and on the proportion of ranges that 
can be allowed to exceed U. These three quantities are related to <t x 
through the cumulative distribution of the standardized range, which 
gives the probability that the range Rk of k items from a normal dis¬ 
tribution will exceed any specified multiple IF* of the population standard 
deviation <r. Tables of the cumulative distribution of the range have 
been published by Pearson, f In Fig. 5.1, which is based on Pearson’s 

* If the assumption of normality is not justified, the inspection can be carried out 
on an attributes basis, by classifying each group of 14 in the sample as satisfactory 
or not. 

t Pearson's table gives cumulative probabilities to 4 decimal places for values of k 
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tables, the vertical axis shows the probability that Rk/<r will exceed the 
values of W shown on the horizontal axis, and the various curves cor¬ 
respond with various group sizes ft. For our purposes, Rk is given, W 
is obtained from Pearson’s table or from Fig. 5.1, and <r is the quotient 



Fig. 5.1.—Cumulative probability distribution of the range in samples of k from a normal 

population. 


For the illustration cited, we find the value of Wu corresponding with 
probabilities of 0.99 and 0.85 on the curve for ft = 14. These are 5.40 
and 4.20, respectively; on substituting them and Rk = 0.3 into (1), we find 
and K 4 as the values of <ri and 0 - 2 . 

Thus the problem becomes that of devising an inspection procedure 
for the standard deviation such that the operating-characteristic curve 
passes through the two points 


(in his notation n) from 2 to 20 by steps of 1 and for values of W from 0 to 7.25 by 
steps of 0.01. 
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<ri = 0.05556 L. t = 0.99 (2) 

<T t = 0.07143 L'\ = 0.01 (3) 

Such an inspection procedure may be carried out by either single or 
sequential sampling. The solutions are indicated only briefly here.* 


3. SINGLE SAMPLING 

To determine the required sample size, we computet 



(4) 


Then in a table of x 2 we find the number of decrees of freedom n for which 



(5) 


where x? denotes the value of x 2 exceeded with probability e. 

In the present example, since a = 0 — 0.01, we may refer to Table 8.2 
and seek 1.6531 in the column for 0 = 0.01; however, the table shows 
only that n > 120. We therefore use the large sample approximation % 


_ 1 ( K a a, + jgg» y 

2 \ 0-2 — cri ) 

= 173 


where K a and Kp represent normal deviates exceeded with probabilities 
a and 0, respectively, which may be found from Table 1.1 in Chap. 1. 
Thus 173 is the number of degrees of freedom required, and since the 
mean is unknown the required sample size is 

N = n + 1 (7) 

= 174 


From the sample of 174, we calculate 



i-1 


( 8 ) 


where Xi {i = 1,2, • • • , N) represents the value of the ith measurement 
in the sample. The lot is rejected if S > So, where 


So = (9) 

* For a discussion of tests of the standard deviation see, on single sampling, 
Chap. 8, and, on sequential sampling, Chap. 17 below and Statistical Research Group, 
Sequential AnolyeU, Sec. 6. 

t See Chap. 8, Sec. 2.4, especially Example III, pp. 274-278. 
t See Chap. 8, Sec. 2.4, first footnote on p. 278. 
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in which x*(n) represents the value of x 2 for n(= N — 1) degrees of 
freedom exceeded with probability a; otherwise the lot is accepted. In 
the present example, n is outside the range of the x 2 tables, so we use 
the approximation (due to R. A. Fisher) 


xlM 


(Kg + V2^1) 2 
2 

218.416 


( 10 ) 


The OC curve is easily obtained from the fact that the probability 
La of accepting a lot of quality <r when offered is simply the probability 
that a value of x 2 based on n degrees of freedom will be less than $o/<r 2 . 
When a = 0.01 or 0.05 and n does not exceed 120, Table 8.1 or 8.2 
provides 13 points on the 0*C curve; values of <t 2 /(t\ are shown for various 
values of L a . When n is beyond the range of the table, the OC curve 
can be found from 


(K a + y/2n)(Ti 
y/2n “ 


( 11 ) 


4. SEQUENTIAL SAMPLING 

A sequential-inspection plan based on standard deviations is defined 
by two parallel straight lines given by the formulas 


Z n - h 2 + sn (rejection line) (12) 

Z n = —hi+sn (acceptance line) (13) 

where hi, hi, and s are positive, n represents one less than the number of 
observations taken sequentially, and 


Zn 





(M 2 

n + 1 


is the sum of squares of deviations of the n + 1 observations from their 
mean. 

The sequential procedure is as follows: Draw the two parallel lines 
given by equations (12) and (13) with Z n on the vertical scale and n on 
the horizontal scale. Start taking observations, beginning with two 
observations and thereafter adding one observation at a time. After each 
observation compute Z n from all observations and plot its value against 
the corresponding value of n (which is one less than the total number of 
observations thus far taken). If at any stage the plotted point falls on 
or above the upper line, given by equation (12), reject the lot; if it falls 
on or below the lower line, given by equation (13), accept the lot. If, 
hpwever, the plotted point falls between the two parallel lines, take 
another observation. Continue this process until a decision of acceptance 
or rejection is reached. 
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The two intercepts — hi and h 2 and the common slope s, which deter¬ 
mine the two parallel lines, are given by 

1 - a 


hi = 


2 log* : 


fi 


1 1 


(14) 




<r\ 


= 0.071799 

1 - fi 


h 2 -- 


2 log* 


1_ 
°i 


1_ 

A 


(15) 


and 


0.71799 




(16) 


A 


A 


= 0.0039268 


The OC curve for the sequential plan is computed from the following 
formulas: 


■ V 1 


2 1 


_ gth\ 

gt(hi+hi) — ^ 


(17) 


It will be observed that L a , the ordinate of the OC curve, is not given 
directly as a function of «r, but parametrically through a third variable t 
in terms of which <r is also defined. The OC curve for this example is 
shown in Fig. 5.2. 

The average sample number (ASN) curve shows the relationship 
between the standard deviation in the submitted lot and the average 
number of items that will be inspected before a decision is reached. The 
ordinate N, of the ASN curve, corresponding to the abscissa <r, is given by 


_ L a (h\ + h 2 ) — h 2 , 

~ s - <r 2 ' + 1 


(18) 


where L 9 and <r are given by equations (17). Formula (18) can be evalu¬ 
ated directly for every value of a except <r = y/s f for which it can be 
shown that 


in- hih 2 . - 

N Vi = w +1 


(19) 


This will ordinarily be close to the maximum average amount of inspection 
required to reach a decision. The ASN curve for the present example 
is included in Fig. 5.2. 
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L<r N<r 



Fio. 5.2.—OC and ASN curves for sequential-sampling plan based on standard devi¬ 
ation; cri =5* 0.05556, <r* = 0.07143, a = 0.01, 0 - 0.01; hi - 0.07180, Ji, = 0.07180, 
8 = 0.003927. 
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CHAPTER 6 


VARIABILITY OF AMOUNT OF INSPECTION 
FOR DOUBLE, MULTIPLE, AND SEQUENTIAL SAMPLING 

1. SINGLE, DOUBLE, MULTIPLE, AND SEQUENTIAL SAMPLING 

Sampling inspection is said to be based on attributes if inspection 
of each item results in its classification as defective or nondefective, and 
disposition of the lot or process from which the sample comes depends 
upon the number of defectives found in the sample. One or more sam¬ 
ples are selected at random from the lot; if the total number of defectives 
found in the first k samples does not exceed an acceptance number A k , 
the lot is accepted; if it equals or exceeds a rejection number R k (R k > A*), 
the lot is rejected; if it falls between A k and R k , another sample is 
inspected. 

A single-sampling plan is one for which 

Ri - A x + 1 (1) 

A double-sampling plan is one for which 

Ri > A\ + 1 

# 2 - A 2 + 1 (2) 

A multiple-sampling plan is one for which 

R k > A k + 1 for k < K (3) 

R k = A k + 1 for some K > 2 

In a sequential-sampling plan, the successive samples are all of size 1 and 

R k — A k = C + h k (4) 

where C is some constant and each 5 k is 0 or 1.* 

* For an exposition of single, double, and multiple sampling, see Statistical 
Research Group, Sampling Inspection; what is here called “multiple sampling” is 
there called “sequential sampling,” and what is here called “sequential sampling” is 
there called “item-by-item sequential sampling.” For an exposition of sequential 
sampling see Chap. 17 of this volume, or Statistical Research Group, Sequential 
Analysis. (For full citations, see Sec. 5.) 

Equation (4) is a necessary condition for a sequential-sampling plan, but it is not 
quite a sufficient condition unless further conditions are imposed on the 5*. 
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2. VARIABILITY OF AMOUNT OF INSPECTION 

One of several factors determining the relative merits of single, 
double, multiple, and sequential sampling is the variability of the amount 
of inspection for plans giving equivalent operating-characteristic (OC) 
curves. Variability of the amount of inspection causes fluctuations 
in the inspection load and thus may create administrative problems in 
an inspection program. 

Variability may be interpreted in two ways. First, it may mean 
variation in the average number of items inspected per lot as the quality 
of submitted lots varies; in other words, the amplitude of the rise and 
fall of the average sample number (ASN) curve. Second, it may mean 
variation in the number o£ items inspected for lots of a given quality. 

Clearly, single sampling will show less variability, in either sense, 
than double, multiple, or sequential sampling. In fact, with single 
sampling there will be no variability of either kind unless inspection is 
curtailed, that is, terminated as soon as the rejection number is exceeded 
or the number of items remaining uninspected is insufficient to cause 
rejection even if all are defective. 

No general relations among the variabilities of double, multiple, 
and sequential plans are known. The purpose of this chapter is to 
present two illustrative comparisons. These two comparisons suggest 
by their differences that no simple conclusions about the relative varia¬ 
bility of the three types of sampling are valid. In the illustrations it is 
assumed that the first sample of the double and multiple plans will be 
inspected completely (this being usual* to facilitate estimation of the 
process average), that inspection of later samples will be curtailed as 
soon as the rejection number is reached, but that curtailing will not be 
applied to. accepted lots. 

3. ILLUSTRATIONS 

Example I. Table 6.1 presents a set of double, multiple, and 
sequential plans whose OC curves are nearly the same.f Figure 6.1 
shows their average sample numbers ft as functions of lot quality, 
and Fig. 6.2 shows their standard deviations <r(N) of amount of 
inspection for lots of a given quality as functions of lot quality. 

Example II. Table 6.2 presents a second set of plans with match¬ 
ing OC curves;} Fig. 6.3 shows fif as a function of lot quality, and 
Fig. 6.4 shows <r(N) as a function of lot quality. 

* It is not, however, necessary. See Girshick, Mosteller, and Savage, 
“Unbiased Estimates for Certain Binomial Sampling Problems with Applications.*' 
t The OC curves of the double and multiple plans are shown in Table 4 of Sampling 
Inspection for sample size letter H , acceptable quality level (AQL) class 0.32-0.65 
percent defective (p. 288). 

t The OC curves of the double and multiple plans are shown in Table 4 of Sampling 
Inspection for sample size letter K/AQL class 3.2-4.4 percent defective (p. 330). 
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TABLE 6.1 

Double, Multiple, and Sequential Plans of Example I 

DOUBLE SAMPLING 


Combined samples 


Sample 

Sample 

size 

Size 

Acceptance 

number 

Rejection 

number 

First.*... . 

50 

50 

0 

3 

Second.. . . .. 

100 

150 

2 

3 


MULTIPLE SAMPLING 




Combined samples 

Sample 

Sample 

size 

Size 

Acceptance 

number 

Rejection 

number 

First. 

20 

20 

* 

2 

Second. 

20 

40 

+ 

2 

Third. 

20 

60 

0 

2 

Fourth. 

20 

80 

1 

3 

Fifth. 

20 

100 j 

1 

3 

Sixth. 

20 

120 ! 

1 

3 

Seventh. 

20 

140 

2 

3 


* Acceptance not permitted until three samples have been inspected. 


SEQUENTIAL SAMPLING 

« - 0.05 0 - 0.10 pi = 0.005 pt - 0.05 
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R 



Fio. 6.1.—ASN curves for double, multiple, and sequential plans of Table 6.1. 
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<r(N) 
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Percent defective in tot 

Fia. 6.3.—ASN curves for double, multiple, and sequential plans of Table 6.2. 
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Percent defective in lot 


Fig. 6.4.— <r(N) curves for double, multiple, and sequential plans of Table 6.2. 
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TABLE 6.2 

Double, Multiple, and Sequential Plans op Example II 

* DOUBLE SAMPLING 




Sample 

size 

Combined samples 


Sample 

Size 

Acceptance 

number 

Rejection 

number 

First. 

150 

150 


9 

24 

Second. 

300 

450 


23 

24 




MULTIPLE SAMPLING 


Sample 


Sample 

size 


Combined samples 


Size 


Acceptance Rejection 
number number 


First. 

50 

50 

Second. 

50 

100 

Third. 

50 

150 

Fourth. 

50 

200 

Fifth. 

50 

*250 

Sixth. 

50 

300 

Seventh. 

50 

350 

Eighth. 

50 

400 



1 

3 

7 

10 

13 

16 

19 

24 


6 

9 

13 

16 

19 

22 

25 

25 


SEQUENTIAL SAMPLING 


a - 0.05 0 = 0.10 pi = 0.0410 p 2 = 0.0920 


4. COMPUTATIONAL METHODS 

For double and multiple sampling, N and <r(N) may be calculated 
by an extension of methods described in Sampling Inspection , Chap. 17. 
For sequential sampling, the computations for N = E(N) are based on 
the formula derived in Sequential Analysis , Sec. £.0921, and those for 
a(N) on the relation 

* 2 (N) - E(N 2 ) - [E(N)V (5) 

where the formula for E(N 2 ) may be obtained by an extension of the 
method used in Sec. 5.0921 of Sequential Analysis for obtaining E(N). 
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CHAPTER 7 


PLANNING AND ANALYZING EXPERIMENTS FOR COMPARING 
TWO PERCENTAGES 

1. TYPE OF PROBLEM 

The type of problem dealt with in this ;hapter may be indicated 
by a simple example. Suppose that trials of two fire-control devices 
have resulted in the following data: 


TABLE 7.1 



Hits 

Misses 

Total 

Old. 

3 


200 

New. 

4 


200 



Total. 

7 


400 




The question is whether the new method, which shows 33 percent more 
hits than the old, can be expected to yield more hits in the long run; or 
whether its superiority in this experiment ought to be regarded as mere 
chance, in the sense that two series of trials with the same method would 
often differ as much. 

We may look at the question this way: From 400 rounds, 7 hits 
were secured. If the two methods are really equally good in the long 
run, these 7 hits should be divided between the two series in proportion 
to the total numbers in the series, except for chance variations. In 
the present case we should expect the 7 hits to be divided equally between 
the two series; and similarly the 393 misses. As it happens, the division 
in Table 7.1 is as nearly equal as it possibly could be, since the entries 
are necessarily integers. It is therefore immediately obvious in this 
particular instance that the probability is one-half that, if the two 
methods were really equally good in the long run, a single experiment 
would show for the new method as much (or more) apparent superiority 
as is shown in Table 7.1. (If this is not obvious, it can be verified by 
the method described below for use with cases where the answer is not 
apparent.) 

Data such as those of Table 7.1 represent a double dichotomy. 
The observations are dichotomized according to one criterion—that is, 
they are divided into two mutually exclusive and exhaustive classes—and 
then each class is dichotomized again according to a second criterion. 
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The result of doubly dichotomizing the data is, of course, the same 
whichever criterion is applied first; that is, we may regard the data 
as classified first by method (old or new) and then by effect (hit or miss), 
or we may regard them as classified first by effect and then by method. 
The marginal totals of Table 7.1 represent two independent dichotomies 
of the data; they do not constitute a double dichotomy. The problem 
may be put in either of two ways: (1) Is the division between effects the 
same for both methods? (2) Is the division between methods the same 
for both effects? These questions are logically identical, though for a 
particular problem one of them usually seems more natural than the other. 
The problem is thus to test the homogeneity of a pair of dichotomies. 

2. METHOD OF ANALYSIS* 

2*1. Exact Method 

In order to illustrate the general procedure for testing the homo¬ 
geneity of a double dichotomy such as that of Table 7.1, consider the 
data of Table 7.2, where the answer is not immediately apparent. 


TABLE 7.2 



Successes 

Failures 

Total 

Standard. 

2 

198 

200 

Experimental. 

5 

195 

200 

Total.. 

7 

393 

400 






For convenience and generality the two methods compared are here 
designated “standard” and “experimental” and the two possible outcomes 
of a trial with either method as “success” and “failure.” In practice, 
of course, the two methods may be equally experimental or equally 
standard, and the two possible outcomes may be two types of success or 
two types of failure or merely two different outcomes not properly 
regarded as either success or failure. 

It might appear, in view of the remarks in Sec. 1, that we could 
treat the 2:5 division of successes like a 2:5 division of heads and tails 
on 7 tosses of a coin, using the binomial distribution to find the probability 
of as many as 5 heads on 7 tosses if the coin were really unbiased. Table 
7.2, unlike Table 7.1, however, gives different answers for the successes 
and failures columns according to this treatment; so some other method of 
analysis is called for. (In this particular instance, however, it happens 
that analysis of the seven successes according to a binomial distribution 
gives an excellent approximation to the correct probability, because both 
proportions of successes are small; see below, Sec. 2.2, p. 255.) 

'' We proceed as follows. Let P designate the true long-run probability 

, + See Fisher, Statistical Methods , Secs. 21, 21.01, and 21.02. (For full citations, 
see Sec. 6.) ^ 
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of a success, that is, the probability of a success in the population from 
which the sample comes. P, of course, is unknown, for we should have 
to carry out an infinitely long series of trials to measure it. But we 
make our calculations on the assumption that P is the same in both 
series, and, as we shall see, it cancels out. If, on the assumption that P 
is really the same in the long run for both series, we find that a difference 
as great as that shown in the data would be unlikely, then the fact that 
this unlikely difference has actually been observed experimentally leads 
us to conclude that the assumption was wrong—to conclude, in other 
words, that the long-run probabilities are not equal. 

The probability that 200 trials will show Exactly 2 successes is, by 
the binomial distribution, 


200 ! 

21198! 


P 2 ( 1 


P) 1M 


( 1 ) 


Similarly, the probability of exactly 5 successes is 


200 ! 

51195! 


P B (! - P) 196 


( 2 ) 


So the probability that the two series of 200 trials will show exactly 2 
successes for the first series and exactly 5 for the second is the product of 
(1) and (2) 

200 !200 ! pin _ p)393 /q) 

21511951198! ^ ' w 


This is an absolute probability, whereas we require the conditional 
probability that if exactly 7 successes and 393 failures occur in the two 
series they will be divided between the series as shown in Table 7.2; 
hence (3) must be divided by the probability of exactly 7 successes in 
400 trials. 


400! 

71393! 


p7(l __ p)393 


(4) 


The probability that if we get 7 successes among two series of 200 each, 
they will be distributed 2 to the standard method and 5 to the experi¬ 
mental, is the ratio of (3) to (4), or 


7!39312001200! 
400121511951198! 


0.1636 


(5) 


It should be noted that expression (5) would be the same if expressions 
(1), (2), and (4) had been formed from the two columns instead of froth 
the two rows of Table 7.2. 

In general, expression (5) is composed as follows: The numerator is 
the product*of the factorials of the four marginal totals; the denominator 
is the product of the factorials of the four cell entries and the factorial 
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of their sum. Thus, if we give Tables 7.1 and 7.2 the general representa¬ 
tion shown in Table 7.3, 

TABLE 7.3 



Successes 

Failures 

Total 

Standard. 

a 

b 

d b 

Experimental. 

c 

d 

c + d 

Total. 

d -f- c 

b + d 

d b -f* c + d 


expression (5) may be written 

(a + 6)!(c + d)!(a + c)!(5 + d)\ 
a\b\c\d\(a + b + c + d)\ ' 


Expression (5) gives the probability that if in fact the long-run 
probabilities were equal, an experiment involving 200 trials of each method 
and resulting in a total of 7 successes would show exactly the superi¬ 
ority for the experimental method that is shown by Table 7.2. For 
the final probability to use in judging whether there is a long-run differ¬ 
ence, we must add to (5) the probabilities of superiorities greater than 
that of Table 7.2. Tables 7.4 and 7.5 show possible results that would 
have indicated even greater superiority for the experimental method than 
does Table 7.2. 

TABLE 7.4 



Successes 

: 

Failures 

Total 

Standard. 

1 

199 

200 

Experimental. 

6 

194 

200 


Total. 

7 

393 

400 




Expression (6) gives the probability for Table 7.4 as 


7139312001200! 
400111611941199! 


(7) 


TABLE 7.5 



Successes 

Failures 

Total 

Standard. 

0 

200 

200 

200 

Experimental. 

7 

193 



Total. 

7 

393 

400 




Expression (6) gives the probability for Table 7.5 as 


7139312001200! 
400101711931200! 


= 0.0074 


( 8 ) 
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The sum of the three probabilities (5), (7), and (8), or 0.2244, is 
the probability that the experiment will show as much or more apparent 
superiority of the experimental method as Table 7.2 shows, if in fact 
there is no real difference. It should be noted that we add together the 
probabilities of only those cases representing greater superiority for the 
experimental method than was observed in the sample. We do not 
include the probabilities of cases representing greater superiority for the 
standard than was observed for the experimental method. This is 
because we are analyzing the data in order to decide whether the experi¬ 
mental is superior to the standard method, not simply to decide whether 
there is a difference between the methods without regard to its direction. 

Finally, a conclusion is reached regarding the superiority of the 
experimental method as follows: We agree to accept the experimental 
method as superior to the standard on the basis of an analysis of the 
above type only when the resulting probability is less than a, where a is 
chosen in advance and is generally taken to be 0.10, 0.05, 0.01, or 0.001, 
depending upon the seriousness of accepting the experimental method as 
superior when it is not. Then a is the probability of accepting the 
experimental method as superior when actually there is no real difference 
between the methods. In terms of any of the usual values of a, the 
probability 0.2244 would imply that the data of Table 7.2 provide no 
basis for concluding that the experimental method is actually superior 
to the standard. 

2.2. Short-cut Methods 

The evaluation of the factorials involved in expressions (5), (7), and 
(8) was done by cancellation of common factors. This procedure is the 
simplest in this case, but in general it may be very laborious. The 
calculations can be considerably reduced by using a table of the logarithms 
of factorials. * 

A much simpler procedure, which usually yields very satisfactory 
approximations to the exact method described in Sec. 2.1, is available 
when at least two (and preferably five) successes would be expected in 
each cell if there were no difference in the long-run probabilities for 
the two methods. This expected number of successes is computed for any 
cell by dividing the product of the marginal totals for that cell by the 
total number of trials made; thus, the expected number of successes for 
the standard method is, on the basis of Table 7.3, 

(a + b)(a + c) / Q v 

cl + b 4“ c -f" d 

* A convenient table is given in Fry, Probability and He Engineering Usee, 
Shorter tables can be found in various handbooks, 
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In such cases we calculate a quantity x* as follows: 

First, adjust the table of observations so that each frequency is 
brought J* unit nearer the frequency expected if there were no long-run 
difference between the two methods. This may be done by adjusting one 
frequency and then changing the others so that the marginal totals of 
the new table equal those of the old. For example, in Table 7.2 the 

7 X 200 

expected frequency for the first cell is —— = 3so we increase 

the observed figure of 2 to 2^. Changing the other figures to keep 
the marginal totals fixed, we have 


TABLE 7.6 



Successes 

Failures 

Total 

Standard. 

24 

4 4 

197 4 
195,4 

200 

200 

Experimental. 

Total. 

7 

393 

400 



Table 7.6 represents Table 7.2 “corrected for continuity.”* 

Second, each adjusted frequency is subtracted from the corresponding 
expected frequency, the differences are squared, the squares are divided 
by the corresponding expected frequencies, and the ratios are summed to 
obtain Xc • 

The final value of xl may be obtained directly without going 
through the intermediate steps, as follows: Calculate the products of 
frequencies diagonally opposite each other in the adjusted table; subtract 
one diagonal product from the other; square the difference; multiply the 
square by the sum of the four frequencies in the table; and divide this 
product by the product of all four marginal totals. From the data of 
Table 7.6, we have 


f _ (2X X 1951* -4 HX 197 ^2) 2 X 400 
c 7 X 393 X 200 X 200 


0.5816 


( 10 ) 


More generally, if we replace a, 6, c, and d in Table 7.3 by a', 5', c', and d\ 
representing the frequencies corrected for continuity, (10) becomes 

, _ (a'd f - b'c') 2 (a + b + c + d) 

Xc (a + b)(a + c)(c + d)(b + d) 

* Although this account of the continuity correction describes the procedure 
accurately, it may convey an erroneous idea of the theoretical basis for the correction. 
The general procedure is to carry through the calculation of x from the original data, 
then find the next lowest value of x that could have arisen, and finally take xc as 
halfway between these two values. In the particular case of double dichotomies, this 
valjie of x can be found most easily by correcting the original data as in Table 7.6; 
but in most other cases where the x test is used, correction of the original data will 
lead to serious error in the value of x- For an excellent analysis and presentation of 
thiB point, see Cochran, “The x a Correction for Continuity,” especially p. 423. 
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Next, we take the square root and find Xc = 0.7626. This may 
be treated as a normal deviate; that is, the required probability is given by 


*/,: 




( 12 ) 


Referring to a table of the normal probability integral,* for example 
Table 1.5 of Chap. 1, we find the probability to be 0.2228. This com¬ 
pares favorably with the exact probability of 0.2244 found in Sec. 2.1. 

More accurate values of Xe corresponding to the probabilities 0.025 
and 0.005 than are implied by (12) are given in Table VIII, “Tests 
of Significance for 2 X 2 Contingency Tables,” in Fisher and Yates, 
Statistical Tables . Detailed instructions and i-lustrations for the use of 
this table are given in its introduction. 

When the percentage of successes is small and the number of trials 
in each series large, as is the case in the examples discussed here, the 
successes may be analyzed by the binomial distribution, along the lines 
discussed in Sec. 1 and the second paragraph of Sec. 2.1.f The value of 
P for the binomial test is the proportion of all trials that fall into, say, 
the standard series. In the illustration of Table 7.2, P = Accord¬ 
ing to the binomial distribution, the probability that seven trials of an 
event having two equally likely outcomes will show as many as five or 
more of a designated type (in other words, the probability that with 
seven successes observed five or more will occur in the experimental 
series) is 


1 / 7! 71 

2 7 \2!5! + 1!6! 


+ 




2266 


(13) 


This conforms satisfactorily to the exact value 0.2244 found in Sec. 2.1. 


2.3. Alternative Methods 

A procedure that has been widely used for testing the significance 
of the difference between two proportions is based on the following 
considerations: The difference between the two percentages is divided 
by the standard error of the difference, and the ratio is regarded as 
normally distributed. The standard error of the difference is computed 
from the familiar relation that the standard error of the difference 
between two independent statistical variables is the square root of the 
sum of the squared standard deviations of the two va riables. The 
standard error of a proportion p is well known to be y/P{ 1 — P)/N, 


* For a detailed table, see the tables prepared by the New York Mathematical 
Tables Project. 

t See Przyborowski and Wilenski, "Homogeneity of Results in Testing Samples 
from Poisson Series," especially Sec. 2, Derivation of Test, pp. 314-317. 
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where P is the true population value of the proportion and N is the 
number of trials. 

Applying these considerations, it is often concluded that 


Vn (p E - p s ) _ 

VW1 — Psj + Ps( 1 - Pe) 


(14) 


where ps is the observed proportion for the experimental method 
ps is the observed proportion for the standard method 
N is the number of trials with each method 
may be treated as normally distributed with mean of 0 and standard 
deviation of 1. If the probability of obtaining a normal deviate as 
large as V is small, it is concluded that the difference in the proportions 
is greater than can be ascribed to chance. 

There are two serious objections to the procedure by which expres¬ 
sion (14) is derived: (1) V will not actually be distributed as assumed; 
and (2) the procedure is not self-consistent. 

With regard to the distribution of V, there are two considerations: 
In the first place, the true population standard deviation of the difference 
is required in the denominator if the ratio is to be normally distributed 
with unit variance when N is large; but actually only experimental 
values are available. Hence, chance variations affect not only the 
numerator but also the denominator of V . In the second place, even 
if we know the true standard deviation, the distribution of the difference 
(ps — Ps) cannot be satisfactorily approximated by a normal distribution 
when the proportions are near 0 or 1, unless N is very large. 

The procedure leading to expression (14) is inconsistent in two 
respects. It sets out to calculate the probability that, if there were 
actually no long-run difference, a difference as great as that observed 
would occur by chance. But if there were no long-run difference in the 
proportions, there would be none in the standard deviations. Therefore, 
when the standard deviation is estimated from the experiment, the esti¬ 
mate should be based on the two series together. This leads to 
considering 

X - VNip.-p.) (16) 

as a normally distributed variable. 

The second inconsistency in expression (14) may be seen as follows: 
It would be perfectly logical to find what proportion of the total successes 
were made with the standard method and what proportion of the 
total failures were made with the experimental method. These propor¬ 
tions should be equal if the standard and experimental methods are 
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equally effective. But if the significance of the difference of this pair of 
proportions is tested by expression (14), the result is not the same as 
when the logically equivalent pair based on methods is tested. Expres¬ 
sion (15), however, is consistent in this respect as well as in the standard 
error it uses. 

Expression (15) is mathematically identical with formula (11) 
in the previous section, except that formula (11) is calculated from the 
data corrected for continuity. Thus, in terms of the frequencies of 
Table 7.3 (without continuity corrections), expression (15) can be 
written as 

v = y/a + b + c + d (g j - be) n ^ 

X V(o + b){a + c)(b + rf)(c + d) 1 ’ 


with a + b = c + d~N. This is simply the square root of expression 
(11), neglecting the continuity corrections. When the continuity cor¬ 
rections are taken into account and N is introduced for simplification, 
expression (16) becomes 


ad — be + N 
V(N/S)(a + c)(b + d) 


(17) 


Note that N in expressions (14), (15), (16), and (17) represents the 
number of trials in each set. Also note that (17) is valid only when the 
continuity corrections increase a and d and decrease b and c; for the 
reverse situation, +N in the numerator of (17) must be replaced by — AT. 

Expression (14), though it is theoretically inferior to expressions (15) 
and (17), does not differ materially from them when N is large. Since 
(15) and (17) are theoretically superior and are little or no more trouble 
to calculate, they are always preferable to (14). Even (15) and (17), 
however, are approximations; they fail for very small values of N or 
very small probabilities of a success, and in such cases the procedures 
outlined above in Secs. 2.1 and 2.2 must be used. The procedure of Secs. 
2.1 and 2.2 should be used also when a + b is different from c + d. 

For practical work with double dichotomies, the Zubin Item Analyzer 
gives results essentially equivalent to those of expression (14).* In 
this device the difficulties that (1) proportions are not normally dis¬ 
tributed when the true proportion is small, and (2) the standard devia¬ 
tion depends upon the true proportion, are avoided by employing a new 
variable 0, which is a function of p. The function is chosen so that, first, 
0 is more nearly normally distributed than p, and second, the standard 
deviation of 0 is independent of the true long-run value of 0 or p. A 

* For a description of this device, see Fulcher and Zubin, “The Item Analyzer.” 
A model constructed by Dr. Fulcher was lent to the Statistical Research Group for 
trial in 1943. It proved quick and easy to use and would be cheap to build, as it 
involves only plywood and celluloid. 
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function that meets these requirements when N is not too small is 

0 = 2 arcsin y/£ (18) 

Unless N is very small, 0 may be regarded as normally distributed; the 
standard deviation is 1/y/N when 0 is measured in radians or 57.3/\/# 
when 0 is measured in degrees. The Zubin Analyzer is a mechanical 
device for transforming the proportions to the proper angles and reading 
directly the difference {Be — 0 s). The required probability can then 
be read from a simple chart showing the difference and its standard error, 

VVN.* 


S. NECESSARY NUMBER OF TRIALS 
A method that is sometimes recommended for finding the required 
sample size in experiments involving double dichotomies is to set V in 
expression (14) equal to the value V«, which would be exceeded only a 
percent of the time if there were no true differences in the proportions, 
and then solve for N, using values of pa and p E obtained experimentally. 
The result is 

„ - Pa) + p*(l - P«)l . 

M =- 7 -rr 3 - ( 19 ) 

(pa — p S ) 2 

This procedure can be made logically consistent by basing the calcula¬ 
tions on expression (15) instead of on (14), in which case the formula 
becomes 


N = 


+ p.) (l - 
fa* - Ps ) 2 


( 20 ) 


where K a is the normal deviate exceeded with probability a, as given in 
Table 1.1 of Chap, l.f 

In general, (20) solves correctly the following problem: If the propor¬ 
tion of successes is p s for the standard and p s for the experimental 
method, how large must be the number of trials from which these propor¬ 
tions were computed before the experiment can be regarded as indicative 
of a real difference? The question stated is, however, of little or no 
practical significance. The practical problem exists only because it 
cannot be assumed that the proportions shown in the experiment are 

* See Chap. 16 for an account of the inverse sine transformation for percentages. 

t The value of K£ can be found from tables of the chi-square distribution, for 
example, Table IV in Fisher and Yates, Statistical Tables , thus eliminating the need of 
squaring K a * Since the probability that x 2 will exceed a certain level is the probability 
that x will exceed the square root of that level plus the probability that — x will exceed 
the Square root of that level, the probabilities given in the x 2 tables must be halved 
for the present purposes. 
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the true proportions. If they were, we should be able to decide without 
difficulty which method is superior. Furthermore, even if we were 
willing to accept the proportions p s and p E shown in the experiment as 
suitable approximations of the true long-run proportions P s and P Bf we 
could compute N from (20) only after carrying out the experiment to 
find ps and p E . 

To determine a useful formula for N, we must think of formula 
(11) or its equivalent, formula (15), as constituting a rule according 
to which we decide from the data which of two actions to take. If the 
formula yields a value above some predetermined critical level, we accept 
the experimental method as superior, and if it yields a lower value we do 
not accept it (that is, for practical purposes i/e reject it). If the true 
proportion of successes for the experimental method is the same as the 
true proportion of successes for the standard method, it is important 
that the critical level of x be such that there is only a small probability, 
say a, that the data will lead to a value of x above the critical level. 
Similarly, if the true proportion of successes is P s for the standard 
method but P E ( P*• > P s ) for the experimental method, it is important 
that the critical level of x and the sample size N be such that there is 
only a small probability, say /3, that the data will lead to a value of x 
below the critical level. 

The first condition, that the probability be only a of accepting the 
experimental method as superior when the true probabilities of success 
are equal, is met simply by selecting the critical level of x as K ay the 
normal deviate exceeded with probability a. Then, regardless of the 
sample size, the probability is a that the experimental method will be 
accepted when the true probabilities of success are the same for both 
methods. 

On the other hand, for small samples the probability that x will 
exceed K a will not be much more than a even when the true probabilities 
are Ps and P E . This probability of accepting the experimental method 
when the true probabilities are Ps and P E will, however, increase as the 
sample size increases; and the problem is to find that value of N for which 
it is 1 — 0. 

The required value of N, the size of the sample to be used with 
each method, is given by 



0 




arcsin 


VPs 


arcsm 


vr B , 


;)■ 


if angles are measured in radians, or by 


N = 1641.6 


( 


K a + K.s 


arcsin \ 


/Pl- 


arcsm 


VPs 


)' 


( 21 ) 


( 22 ) 
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(23) 

a is the probability of an error of the “first kind,” that is, of accepting 
the experimental method as superior when actually the probability 
of success is the same for both methods 
& is the probability of an error of the “second kind,” that is, of 
rejecting the experimental method when the true probabilities 
for the standard and experimental methods are Ps and P B , 
respectively. * 


Values of N can be read.directly from the nomogram of Fig. 7.1. 

The choice of P s can usually be based upon previous experience with 
the standard method, which will indicate approximately its proportion 
of successes. The value of P E may then be chosen enough above P 3 so 
that the difference is one that it is important to detect when present. 
Moderate variations in the values of P s and Pe are not serious if Ps and P E 
are affected alike. If the angular difference, the square of which con¬ 
stitutes the denominator of formula (22), is accepted as the appropriate 
measure of the difference between two percentages, it is not necessary 
to choose Ps and P E individually but only to determine the angular 
difference that should be detected.! 

To illustrate the application of formula (21), suppose Ps = 0.010, 
P B = 0.025, ol = 0.05, and /3 = 0.10. From Table 1.1 of Chap. 1, we 
find K a = 1.645 and Kp = 1.282. From trigonometric tables,! we find 
arcsin 0.1000 and arcsin 0.1581 to be, respectively, 0.1002 and 0.1588 
radians. Substituting these figures into (18), we have 


(1.645 + 1.282) 2 
2(0.1588 - 0.1002) 2 
= 1,246 


(24) 


* If only one method is involved in an experiment, and if it is desired that the 
probability of approving it be a or 1 — jS according as its true probability of success is 
Ps or Pe, formula (21) will indicate twice the appropriate sample size; that is, the 
appropriate sample size is given by substituting 4 for 2 in the denominator of formula 
(21). This was the method of computing the sample sizes N a given in Table 1.3 of 
Chap. 1 for acceptance-inspection plans based on attributes. For that application, 
the first sentence of this footnote should be changed to read as follows: If only one 
product is involved in an inspection plan, and if it is desired that the probability of 
rejecting be a or 1 — 0 according as the true probability of a defective is pi or p 2 , 
formula (21) will indicate twice the appropriate sample size. 

t In this connection, see Statistical Research Group, Sequential Analysis , Sec. 
3,02, pp. 3.47-3.50, especially Table 3.61 and Fig. 3.61; also Secs. 3.21 and 3.22, pp. 
3.09-3.10. 

t For example, Mathematical Tables Project, Arc sin x. 
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a is the probability of an error of the “first kind,” that is, of accepting 
the experimental method as superior when actually the probability 
of success is the same for both methods 
fi is the probability of an error of the “second kind,” that is, of 
rejecting the experimental method when the true probabilities 
for the standard and experimental methods are Ps and Pe, 
respectively. * 


Values of N can be read directly from the nomogram of Fig. 7.1. 

The choice of Ps can usually be based upon previous experience with 
the standard method, which will indicate approximately its proportion 
of successes. The value of may then be chosen enough above P s so 
that the difference is one that it is important to detect when present. 
Moderate variations in the values of P s and P E are not serious if P s and P E 
are affected alike. If the angular difference, the square of which con¬ 
stitutes the denominator of formula (22), is accepted as the appropriate 
measure of the difference between two percentages, it is not necessary 
to choose P s and P E individually but -only to determine the angular 
difference that should be detected.! 

To illustrate the application of formula (21), suppose Ps = 0.010, 
Pjj = 0.025, a — 0.05, and (3 = 0.10. From Table 1.1 of Chap. 1, we 
find K a = 1.645 and K& = 1.282. From trigonometric tables,! we find 
arcsin 0.1000 and arcsin 0.1581 to be, respectively, 0.1002 and 0.1588 
radians. Substituting these figures into (18), we have 


(1.645 + 1.282) 2 
2(0.1588 - 0.1002) 2 
= 1,246 


(24) 


* If only one method is involved in an experiment, and if it is desired that the 
probability of approving it be a or 1 — 0 according as its true probability of success is 
Ps or P E , formula (21) will indicate twice the appropriate sample size; that is, the 
appropriate sample size is given by substituting 4 for 2 in the denominator of formula 
(21). This was the method of computing the sample sizes N a given in Table 1.3 of 
Chap. 1 for acceptance-inspection plans based on attributes. For that application, 
the first sentence of this footnote should be changed to read as follows: If only one 
product is involved in an inspection plan, and if it is desired that the probability of 
rejecting be a or 1 — 0 according as the true probability of a defective is pi or p 2 , 
formula (21) will indicate twice the appropriate sample size. 

f In- this connection, see Statistical Research Group, Sequential Analysis , Sec. 
3.62, pp. 3.47-3.50, especially Table 3.61 and Fig. 3.61; also Secs. 3.21 and 3.22, pp. 
3.09-3.1(1. 

t For example, Mathematical Tables Project, Arc sin x. 
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If the nomogram is used, we pass a line between 0.05 on the a scale and 
0.10 on the 0 scale and mark its point of intersection with the A scale. 
Another line is passed between 0.01 on the Ps scale and 0.025 on the Ps 
scale, and the point of intersection of this line with the B scale is marked. 
The N scale is calibrated so that the number of trials needed for each 
method can be read off where the N scale is intersected by a straight 
line between the points marked on the A and B scales. In this example, 
the point of intersection falls about halfway between N = 1,200 and 
N = 1,300. 

4. OPERATING CHARACTERISTICS 

Conclusions drawn from double dichotomies depend upon two quanti¬ 
ties, one characterizing the amount of data collated and the other 
characterizing the method of analyzing the data. These quantities 
are N , the number of observations with each method, and K a) the critical 
level of x which, if equaled or exceeded, leads to acceptance of the 
experimental method as superior. Formula (21) can be used to deter¬ 
mine the operating characteristics of any plan for analyzing a double 
dichotomy defined by given values of N and K a . 

By the operating characteristics of a plan is meant the probability, 
as a function of the true nature of the populations, that the plan will 
lead to accepting the experimental method as superior. In this case, the 
operating characteristics may be represented by a surface rather than a 
curve, for the true nature of the populations is measured by two param¬ 
eters Ps and P E . If in the plane of the page we measure P E lengthwise 
and Ps crosswise, any point for which P E and P s are both between 0 and 1 
represents a possible combination of P E and P s . Vertically above each 
point we may erect a line whose height represents the probability that a 
test using the given N and K a will lead to accepting the experimental 
method as superior. The surface formed by the tops of these lines is the 
OC surface. A test is satisfactory if its OC surface shows high probabili¬ 
ties of accepting the experimental method when its superiority is suffi¬ 
cient to make such acceptance important, and low probabilities of accept¬ 
ing the experimental when its superiority is so small (especially when it 
is zero or negative) as to make its acceptance undesirable. 

We may rewrite (21) as 

—Kp = K a — y/2N (arcsin y/Pg — arcsin y/Ps) (25) 

Formula (25) yields —K$; the probability of accepting the experimental 
method is 1 — 0 y which is the probability that a normal deviate will 
exceed —and may be obtained from Table 1.5 of Chap. 1. Thus, for 
any combination of P B and P s , formula (25) yields the probability that a 
test based on N and K a will lead to acceptance of the experimental 
method. 



262 TECHNIQUES OF STATISTICAL ANALYSIS Chap. 7 

Formula (26) does not depend upon the individual probabilities 
P* and Ps but only upon their angular difference 

T = arcsin \/Pe — arcsin \^Ps (26) 

so we may write it 

Kp = y/2N T - K a (27) 

Letting Lt denote the probability that a test defined by N and K a will 
lead to acceptance of the experimental method when the true angular 
difference is T, and noting that K^p = —Kp. we have 


1 f 00 _£L a 

L t — ~ 7 = / _ e 2 dx 

V 27T J Ka- V2N T 


(28) 


as the equation of the OC curve in terms of T. Values of Lt can be 
obtained from Table 1.5, Chap. 1, by taking 


K t = K a — \/2N T 


(29) 
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Figure 7.1 can be used to determine L T from (28). The given values 
of Pe and P s determine a point on the B scale. A line is drawn through 
the point on the B scale and the given value of N on the N scale and 
extended until it intersects the A scale. This point on the A scale and 
the given value of a on the a scale determine a line whose intersection 
with the 0 scale gives 1 — Lt. 

In evaluating a proposed test, it is sometimes helpful to plot the OC 
curve against the true proportion of successes for the experimental 
method, assuming some value, say P' s , for the proportion of successes with 
the standard method. This gives a picture, the significance of which 
depends on the validity of the value used for P ' a , x f the absolute level 
of performance by the experimental method that will result in a high 
probability of its acceptance or rejection. Such a picture can be obtained 
from formula (25), replacing Ps by P s and calculating L T = 1 — 
for various values of P E . * 

Figure 7.2 shows the OC curve for an analysis based on a sample size 
N of 500 for each method and a critical level of x> K a} of 1.645. Two 
horizontal scales are shown, the lower in terms of T, the true angular 
difference between the two probabilities, and the upper in terms of P E 
on the assumption that Ps — 0.10. 


5. TECHNICAL NOTES 

Let s E and s s be the observed numbers of successes in N trials with 
the experimental and standard methods, respectively. Then the 
observed proportions of successes with the two methods are 

= ^ and p s = % (30) 


When we are interested only in alternative hypotheses for which 
P E , the true proportion of successes with the experimental method, is 
greater than P S) the long-run proportion of successes with the standard 
method, the classical test with level of significance a is to reject the null 
hypothesis that P E — Ps if 


Vn(pe - ps) 


where 


4 


(pe + ps) (l - 

f v 

JKa 


Pe ~f“ Ps 


) 


> K a 


1 

V2jr. 


*! 

2 dx 


(31), 


(32) 


Except when N is very small, or P E or P s very close to 0 or 1, this 
test is essentially equivalent to the criterion: Reject the null hypothesis if* 


y/2N (arcsin y/p E — arcsin \/ p a ) > K a (33) 


*< See Chap. 16 for a discussion of the inverse sine transformation of percentages, 
upon which what follows is based. 
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because the quantity 

t = arcsin y/pe — arcsin y/p 8 (34) 

may be regarded as normally distributed with mean of 

T = arcsin y/Ps — arcsin VP's (35) 

and variance 1/2 N, whence it follows that when the null hypothesis is 
true, y/2N t is normally distributed about zero with unit standard 
deviation. 

Now, to find N so that the probability is p of accepting the null 
hypothesis when T = T it is necessary that 


Prob [y/2N (arcsin y/p E — arcsin y/p s ) < K a \T'] = p (36) 
This is equivalent to requiring that 


which becomes 


V2t] 


2 dx = p 


K a - y/2N V = —Kfl 


where K& is defined by replacing a by p in equation (32). 
gives 



or 


N = 


1 ( _ Ka + _J 

2 \arcsin y/P E — arcsin y/P 8 


(37) 

(38) 

Solving for N 


(39) 

(40) 


It may be observed that formula (40) follows immediately from 
formula (148) in Chap. 1, Sec. 11.1.3. We have 


Z l = 0 

Z 2 = arcsin VPe — arcsin VPs 
a = 1 


F(N) = 


1 

V2N 


(41) 

(42) 

(43) 

(44) 


Substituting into equation (148) of Chap. 1, we have 


1 _ arcsin VPs — arcsin VPs .... 

V2N ~ K a + Ke (45) 

which, when solved for N, leads to (40). Similarly, equation (147) of 
Chap. 1 leads to a critical level A of K a /V2N for arcsin Vp* ~ arcsin 
Vpi, which is equivalent to (33), 
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CHAPTER 8 


PLANNING AND INTERPRETING EXPERIMENTS 
FOR COMPARING TWO STANDARD DEVIATIONS 

1. INTRODUCTION 

The present chapter is concerned with the planning and interpretation 
of experiments to determine whether (1) a given product, or process, 
is more (or less) variable than a standard product, or process; (2) one 
of two new products, or processes, is more (or less) variable than the 
other; (3) the random variation of a product, or process, has a component 
assignable to a specific source of variation. 

With regard to proposed experiments, the problem considered is 
that of determining how many observations will be required in order to 
render the discriminating power of the experiment adequate for the 
purposes at hand. The solutions to this problem, in addition to being 
useful in the planning of experiments, provide a means of evaluating the 
discriminating power of an experiment already carried out. Thus, in 
planning experiments for the three purposes listed above, the results 
given below will enable a research worker to formulate an experiment the 
size of which, on the one hand, is compatible with the cost (monetary 
expense, time, inconvenience, and so forth) of obtaining the observa¬ 
tions and, on the other, limits to a known extent the risks of making 
erroneous decisions of practical significance. Alternatively, in the case of 
experiments already conducted, the results given below will enable the 
experimenter to judge the confidence that he can place in conclusions 
based on the data obtained. 

2. EXPERIMENTS TO DETERMINE WHETHER A NEW PRODUCT 
IS MORE VARIABLE THAN A STANDARD PRODUCT 

2.1. Problem 

The following situation is common in practical work: A certain product 
has been in use for some time and its variability with respect to some 
important measurable characteristic x is known. Let <j\ denote the 
known variance (squared standard deviation) of x for this standard 
product. A new product is under consideration, whose variability with 
respect to x is unknown; let <r\ denote the unknown variance of x for the 
new product. Since the value of <r? is unknown, it is not possible to 
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decide with certainty whether the variability of the new product is more 
than that of the standard, that is, whether g\/g\ > 1. 

2.2. General Procedure for Basing an Answer on Observational Data 

An answer to this question can, however, be obtained by computing 
an estimate sf of <r\ from values of x observed in a sample of the new 
product and agreeing that sl/al > C will be taken to imply that g\/g\ > 1, 
and sl/al < C will be taken to imply that g\/g\ < 1, where C is some 
positive constant chosen in advance, that is, before the value of 5 ? has 
been obtained, preferably before the experiment determining the value 
of s\ is carried out. It is evident that if C is very large, application 
of this procedure will almost always lead to the decision that the new 
product is not more variable than the standard (erf < a\) ; if C is very 
small, the decision will almost always be that the new is more variable 
than the standard (<rf > <r§). Clearly, some intermediate value of C is 
called for—a value that will not lead too frequently to the decision 
g\/g\ > 1 when the reverse is true and yet will lead to this decision in 
most of the cases in which the excess of <rf over <rl is of practical 
importance. 


2.3. Standard Procedure* 

The standard procedure is to use as an estimate of <rf a function s\ 
of the observed values of x that has the property that in repeated sampling 
ni$f/<rf is distributed, at least to a good approximation, as x 2 for n 
degrees of freedom, and to 

a. Conclude g\/g\ < 1 whenever 


A < x*(m) 

2 — 

gq n i 


( 1 ) 


b. Conclude g\/g\ > 1 whenever 


> 

2 ^ 
$0 


7&(n i) 


ni 


( 2 ) 


where x£(ni) is the a level of significance of x 2 for n i degrees of freedom, 
that is 


a = P{x 2 (wi) > x2(wi)} 


for n = ni.f 



* Sfee Fisher, Statistical Methods , Chap. IV, Sec. 21.03. (For full citations, see 
Sec. 7). # 

f See Sec. 6, below, for a discussion of published tables of this integral. 
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Example I. The variance <j\ of a particular dimension of a 
standard product is 3.2 cm. 2 From measurements of the dimension 
on 10 items of a new product it is found that sj = 4.8, with 9 degrees 
of freedom. Does the variance <j\ of the dimension with the new 
product exceed the variance of the dimension with the standard 
product? 

If we conduct the test at the 0.05 level of significance, we find 
sl/al = 1.5 < 1.880 = Xo.o 5(9)/9 and are led to conclude < 1; 
that is, we conclude that the variability of the new product does not 
exceed the standard in respect of the dimension in question. 

2.4. Operating Characteristics of the Standard Procedure 

The question arises, what are the chances of the standard rule leading 
to a wrong conclusion? That is, when in fact a\/crl < 1, how often will 
it lead to the conclusion crf/<r§ > 1, termed an error of the first kind;* 
and when in fact > 1!, bow often will it lead to the conclusion 
orf/oo < 1, termed an error of the second kind? In other words, what 
are the operating characteristics of the procedure? 

Clearly, the chance of an erroneous decision will depend on a*, 
ni, and a, as these quantities are involved explicitly in inequalities (1) 
and (2). Intuition tells us that the chance of error depends also on 
the actual value of <r\/al, that is, upon 



To begin with, it is clear from (3) that the probability of an error of 
the first kind is exactly a when p = 1. Second, it will be shown in 
Sec. 6 that the probability of an error of the second kind will be 
exactly 0 if p has the value 

, ,, x xl(ni) /BPX 

p(a,0,fti) = r y ■ ■ (5) 

This equation has been used to compute tables of p(a,/3,ni) corresponding 
to a = 0.05 and 0.01 and to a wide selection of values of /3 and n\. These 
are published as Tables 8.1 and 8.2, pp. 272-273 and 276-277; the method 
of their construction and the accuracy of interpolation is discussed in 
Sec. 6. 

The following examples illustrate the use of Tables 8.1 and 8.2 in 
the interpretation of the results of experiments already carried out 
and in the planning of experiments: 

Example II. For what value of the ratio is the probability 
of an error of the second kind equal to 0.50 when the test of Example 
I is employed? 

* This terminology was introduced by Neyman and Pearson. 
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Then the probability is /9 that the decision -j■ < 1 will be made when = p(0.05,^,n 1 ) = 




































b* 
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For a = 0.05, n x = 9, and p = 0.50, we find from Table 8.1 
that p(0.05,0.50,9) = 2.0280. This means that if <rf is roughly 
double <?o (that is, if the standard deviation of the new product is 
roughly 1.4 times that of the standard) and the test is conducted at 
the 0.05 level of significance with s\ based on only 9 degrees of 

.05 Level of Significance 



Fig. 8.1.—Operating-characteristic curves for experiments to determine whether a 
new product (process) is more variable than a standard product (process) of known vari¬ 
ability, 0.05 level of significance. 

freedom, then there is approximately an even chance that the test 
will lead to the wrong conclusion, < 1. If, to ensure a smaller 
risk of committing an error of the first kind, the test is conducted 
at the 0.01 level of significance, the situation is worse, for 

p(0.01,0.50,9) = 2.5970 

(from Table 8.2). Clearly 9 degrees of freedom for sf are too few 
if a decision that crf/crg < 1 is a serious error from the practical 
viewpoint when a\ is double 

Example III. If a decision that <r\/al < 1 is a serious error 
from the practical viewpoint when <r\ = 1.5cro (p = 2.25), and it is 
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desired to keep the risk of such an error below 0.05 (0 = 0.05) 
when the test is conducted at the 0.05 level of significance (a = 0.05), 
how many degrees of freedom will be needed for sf? 

On looking in the 0 = 0.05 column of Table 8.1, we find that 
ni = 30 is not quite sufficient, since p(0.05,0.05,30) = 2.3670; but 



Fia. 8.2.—Operating-characteristic curves for experiments to determine whether a new 
product (process) is more variable than a standard product (process) of known variability. 
0.01 level of significance. 


ni = 40 is ample, since p(0.05,0.05,40) = 2.1034. By interpolation, 
using 1 /y/n\ as argument, it is found that 34 (33.7) degrees of 
freedom are needed (linear interpolation gives 34.4, which would 
be taken as 35). 

If c 0 = 1.79 cm. (<r§ = 3.2 cm. 2 ) as in Example I, so that 
ci = 1.5o- 0 = 2.69 cm., and it is felt that this is too large an excess to 
have undetected 1 time in 20, but failure to detect that a > cq 
when c\ =* 2.00 cm., say, could be tolerated 1 time in 20, then 
p = 1.25, and it is clear from an examination of Table 8.1 that a 
very large number of degrees of freedom, over 400, are needed— 
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an astonishingly large number to persons who have never investigated 

this matter before.* 

As is pointed out in the footnote to the tables, p(a,(3,ni) is the value 
of for which the probability of the decision < 1 equals when 
the test is conducted at the a level of significance with rt\ degrees of 
freedom. When p(a,j8,ni) < 1, can no longer be interpreted as the 
probability of an error of the second kind, since the decision <r\/<rl < 1 
is correct under these circumstances.! The values of p(a,0,ni) < I have 
been included in the tables since they are useful for drawing the operating- 
characteristic curves. 

Let L p = L(a,ni,p) denote the probability that the decision o-f/o-J < 1 
will be made when v\/a\ = p and the test defined by formulas (1) and (2) 
is conducted at the a level 6f significance with rii degrees of freedom 
for sf. Then L p plotted as a function of p, for fixed values of a and n\ f 
is termed the operating-characteristic curve of the test so defined. Since 
the probability that the decision > 1 will be reached is 1 — L p , in 
view of the exhaustive character of inequalities (1) and (2), it is clear 
that the curve L p completely specifies the operating characteristics of the 
test, that is, the expected performance of the test in repeated applications. 
Thus, it is not necessary to plot the complementary curve 1 — L p , except 
perhaps for psychological reasons. A probability of 0.90 that a correct 
decision will be made may evoke less concern about the stringency of the 
test than a statement that the probability of an incorrect decision is 0.10. 

Figure 8.1 shows the operating-characteristic curves correspond¬ 
ing to 5, 10, 15, 20, 30, 40, 60, and 120 degrees of freedom when 
a = 0.05; Fig. 8.2 shows the corresponding curves when a = 0.01. 

8. EXPERIMENTS TO DETERMINE WHETHER A NEW PRODUCT 
IS LESS VARIABLE THAN A STANDARD PRODUCT 

3.1. Problem 

There are not many situations in which a new product is preferred to 
a standard product only if its variability exceeds that of the standard. 
Consequently, a test of the hypothesis a\/<rl > 1 relative to the altema- 

* The following formula may be used when n is large, say more than 30: 

n B 1 / K-ct&o 4" -KVA 1 
2 \ <n — <ro ) 

1 ( K« +K fi V7 V 
~2V VP“1 / 

where K% denotes a normal deviate exceeded with probability c and may be found from 
Table 1.1 of Chap. 1. 

f From (5) it is clear that p(«,0,ni) * 1 for j8 - 1 — a, and is less than 1 for 
0 > 1 — <*, since xl increases as « decreases, whatever the value of n. 
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tive < 1 is rarely employed. Nevertheless, there are situations 
in which too small a variability is disadvantageous. For instance, an 
average gunner is more likely to bring down a bird with a shotgun than 
with a rifle. It would not be advisable to reduce the dispersion of the 
shot to nearly zero. 

In such cases the standard procedure is modified to 
(a) Conclude > 1 whenever 


A > x|^(ni) 

oo “ ni 

(b) Conclude < 1 whenever 


( 6 ) 


S\ Xl-otfol) 

Vo ni 


(7) 


In connection with this rule, the decision < 1 when in fact > 1 
is an error of the first kind/ whence the maximum value of the probability 
of an error of the first kind is a, which is the probability of the decision 
a\/(j\ < 1 when vi/oo =* 1. Accordingly, the statistical test defined 
by (6) and (7) is said to be conducted at the a level of significance, 
although xi-a(wi) is the 100(1 — a) percentage point of x 2 for n i degrees 
of freedom. 

The value of p' = <rf/v§ for which the probability of a decision 
vf/oo > 1 equals 0 when the test specified by inequalities (6) and (7) is 
employed is readily shown to be 


p'(ot,P,n 0 


xL.(ni) 

rf(»i) 


( 8 ) 


_ 1 

p(P,a,ni) 


Consequently, values of p' are found by interchanging a and fi and taking 
the reciprocal of the corresponding p. Thus 


and 


p' (0.05,0.05, m) 


1 

p(0.05,0.05,7ii) 


(9) 


,'(0.01,0.05,.,) - 00) 

can be obtained from Table 8.1 by taking reciprocals of the relevant 
entries; and values of 

p ; (0.01,0.01 ,ni) = p(0 01)0 01ni ) (11) 

and 


p'(0.05,0.01,ni) = 


1 


p(0.01,0.05, ni) 
can be obtained in similar manner from Table 8.2. 


(12) 
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3.2. Illustration 

Example IV. The scores of students taking a standard examina¬ 
tion have a standard deviation of 20 points (a 0 = 20). An untried 
examination will be acceptable if with it students’ scores have a 
standard deviation not less than 20 points (<xf/<r§ > 1), and the 
authorities are willing to accept a risk of 0.05 (a = 0.05) of errone¬ 
ously rejecting an examination for which <ri = cr 0 = 20. On 
the other hand, acceptance of an examination for which <ri = 10 
(p' = 34) would be a serious error, and it is desired to hold the risk 
of such an occurrence below 0.01. How many students will be 
needed in a preliminary tryout of the examination? 

For a = 0.05 and P = 0.01, we seek the value of n\ for which 
p'(0.05,0.01, ni) is 0.25, that is, the value rii for which p(0.01,0.05,fti) 
is 4. In Table 8.2 we see that 

p(0.01,0.05,16) = 4.0193 (13) 

p(0.01,0.05,17) = 3.8526 (14) 

Accordingly, ni > 17 will be required, which means that the smallest 
number of students (ni + 1) that will suffice Is 18. 

4. EXPERIMENTS TO DETERMINE WHETHER A PARTICULAR ONE 
OF TWO NEW PRODUCTS IS MORE VARIABLE THAN THE OTHER 

4.1. Problem 

When the variance of the measurable characteristic is unknown for 
both products, as it generally is when both are new, the rules of procedure 
given in Secs. 2 and 3 cannot be applied. Let a\ denote the true variance 
of the measurable characteristic in the case of product I, let <r| denote the 
variance in the case of product II, and let s{ and si denote estimates of 
<t\ and <r\ computed from measurements on I and II, respectively. A 
choice between the conclusion v\/v\ < 1 and the conclusion > 1 
can then be based on the observed values of and si, for example, on 
the observed value of their ratio That is, it may be agreed in 

advance that either the conclusion al/al > 1 or the conclusion a\/al < 1 
will be accepted according as s\/sl exceeds or does not exceed some 
number C chosen in advance. 

In selecting C it is necessary to distinguish between the following 
two situations: (1) For various reasons (for example, cost, convenience, 
and so forth) product I is preferred a priori and should be chosen unless 
c\ is substantially larger than (2) Product II is to be chosen if 
c\ > <r§, but it is immaterial which is chosen when <r\ — <r£. 
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4.2. Standard Procedure When Product I Is to Be Chosen Unless a\ Is 

Larger Than c\ 

4.2.1. Procedure. —The standard procedure in case ( 1 ) is to compute 
from the data estimates sf and such that in repeated sampling nis\/a\ 
and U 2 s\/<r\ will be distributed independently as x 2 for n\ and n 2 degrees 
of freedom, respectively, and to base the choice on the following rule: 

(а) Accept product I whenever 

3 < K(n u n2) (15) 

( б ) Accept product II whenever 

7 . > F a (n,,n 2 ) (16) 

where F a (ni,n 2 ) is the a 16 ^el of significance of the variance ratio dis¬ 
tribution 

a = P\F{n h n 2 ) > F m (n h n 2 )} 



Since, under the assumptions regarding the distribution of s{ and s|, 
the ratio ( s\/o\ )/ (sl/al) is distributed as F for n\ and n 2 degrees of freedom 
whatever the values of and erf, it is clear from (17) that the probability 
of (16) occurring when cr\ = <r\ is exactly a . 

Accordingly, if choosing product II when its variability is not less than 
that of product I (that is, when al/al < 1 ) is termed an error of the first 
kind, clearly a is the maximum value of the probability of an error of 
the first kind when the standard procedure is followed and the distribu¬ 
tions of si and si are as assumed. The test is thus said to be conducted 
at the a level of significance. 

Let the true value of the ratio al/al be denoted by 

<t> = 4 (18) 

c 2 

Then it is of interest to determine the value of <t> for which the probability 
is ft that product I will be chosen by application of (15) and (16). The 
answer is readily found from the relation* 

* Shortly after this formula was derived, a British document was received in which 
essentially the same formula is given and attributed to D. V. Lindley; see Baines, 
Economical Design . 
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r, <“> 

= F a {n h n 2 ) • Ffi(n 2i ni) 

Extensive tables of tf>(ag3,ni,n 2 ) corresponding to a = 0.05 and 0.01 and 
to a range of values of 0, ni, and n 2 are presented as Tables 8.3 and 8.4, 
pp. 284-295 and 298-309. The use of formula (19) in the computation 
of these tables is explained in Sec. 6, which also includes a discussion of 
methods of interpolation. Figures 8.3 and 8.4 show the operating-charac¬ 
teristic curves plotted from the tables, for rtx = n 2 = 5, 10, 15, 20, 30, 40, 
60, 120. 

4.2.2. Two Special Cases. —Before illustrating the use and interpreta¬ 
tion of Tables 8.3 and 8.4, we may notice two important special cases 
Wolving ni = n 2 : 

(а) When m = n 2 = n and P = 0.50 

F 0 . 5 o(n,n) = 1 (20) 

for all values of n, whence 

0(a,O.5O,n,n) = F a {n,n) (21) 

* 

In other words, when the degrees of freedom for and si are equal, the 
probability of an error of the second .kind is 3^ when <t\/<t\ = F a (n,ri), 
the critical ratio employed in the test. Thus F a (n y ri) may be termed the 
indifference value of that is, the value of o-?/cr| for which the test 

is equally likely to select either product. 

(б) When ni = n 2 = n and & = a 

<£(a, a, w,n) = F a (n,n) (22) 

In other words, when the degrees of freedom for sf and si are equal, the 
probability of an error of the second kind is a when <ri/<r 2 , the ratio of the 
true standard deviations, equals F a (n,n), the critical ratio employed in 
the test. That is, the risk of an error of the second kind when ai/a 2 = 
F a (n,n) is the same as the risk of an error of the first kind when <ri/o- 2 = 1. 

4.2.3. Illustrations 

Example V. A choice between two products is to be made, the 
relative variability of the two products being relevant but of second¬ 
ary importance in the sense that it is immaterial which is chosen if 
a\/< t\ =* 2, product I being preferred if crf/<r| < 2, product II if 
<rj/ <r\ > 2. If the choice is to be based on (15) and (16), using the 
0.05 level of significance (a = 0.05), and if s\ and $1 are to be based on 
an equal number (ni = n 2 = n) degrees of freedom, how many 
degrees of freedom will be needed for each, and for what value of 
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g\/<j\ is the probability 0.95 that product II will be chosen (fi = 0.05) ? 
From the first special case of Sec. 4.2.2, we have 

^(0.05,0.50,n,n) = Fo.ob(n,ri), 

and from tables of the F distribution we find this to be 1.9838 when 
n = 24. Thus, if 24 degrees of freedom are used for each product, 
the probability that product II will be chosen is 0.05 if = 1,0.50 
if = 2, and (according to case (b) of Sec 4.2.2) 0.95 if <r\/<y\ = 4. 

Example VI. If only 30 degrees of freedom are available for s{ 
and if a test is conducted at the 0.05 level of significance, how many 
degrees of freedom will be needed for si in order to ensure that the 
probability is only 0.05 of product I being chosen when <T\/<rt = 1.5? 
The desired stringency cannot be achieved, since Table 8.3 shows 

<£(0.05,0.05,30, oo) *? 2.367, which exceeds = 2.25. 

♦ 

s \ 

4.3. Procedure When It Is Immaterial Which Product Is Chosen If 

<rf = < t \ 

This situation arises when variability is the primary basis for choosing 
between two products. The following rule is appropriate to this 
situation: 

(a) Accept product I whenever 

72 < F 0 .5o(ni,n 2 ) ( 23 ) 

S 2 

(b) Accept product II whenever 

> F 0.50(^1,112) ( 24 ) 

S 2 

where Fo. 5 o(ni,n 2 ) is the median of the F distribution for ni and n* degrees 
of freedom. Note that ^ 0 . 50 (^ 1 , 712 ) 7 ^ 1 unless n\ = w 2 . 

By setting a = 0.50 in (19), it is readily seen that when the choice 
is based on (23) and (24) the probability is 0 that product I will be chosen 
if 

% - *«>*>*»*> - fegsi <“> 

Conversely, the probability is P that product II will be chosen when 

.1 - *< 0 - 50 ' 1 - 


(26) 
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0.9340 

0.9494 

§ 

0.2497 

0.4933 

0.6123 

0.6797 

0.7229 

0.7530 

0.7752 

0.7924 

0.8061 

0.8174 

0.8347 

0.8527 

0.8717 

0 8816 
0.8919 
0.9027 
0.9142 
0.9263 
0.9397 

& 

0.2497 

0.4931 

0.6119 

0.6791 

0.7220 

0.7518 

0.7738 

0.7908 

0.8043 

0.8153 

0.8323 

0.8499 

0.8682 

0.8778 

0.8877 

0.8978 

0.9085 

0.9198 

0.9317 

3 

0.2497 

0.4931 

0.6116 

0.6785 

0.7211 

0.7507 

0.7725 

0.7892 

0.8025 

0.8133 

0.8300 

0 8471 
0.8650 
0.8742 
0.8837 

0 8934 
0.9035 
0.9140 
0.9251 

£ 

odd do'dgccdedeoobd J d 

*g 

0.2437 

0.4927 

0.6106 

0.6767 

0.7186 

0.7475 

0.7686 

0.7847 

0.7975 

0.8078 

0.8236 

0.8398 

0.8561 

0.8647 

0.8733 

0.8820 

0.8909 

0.9000 

0.9093 


0.2497 

0.4925 

0.6099 

0.6755 

0.7169 

0.7454 

0.76J1 

0.7830 

0.7944 

0.8045 

0.3198 

0.8353 

0.8512 

0.8592 

0.8674 

0.8756 

0.8839 

0.8924 

0.9010 

o 

0.2497 

0.4923 

0.6093 

0.6744 

0.7154 

0.7434 

0.7638 

0.7793 

0.7915 

0.8013 

0.8162 

0.8312 

0.8465 

0.8542 

0.8620 

0.8698 

0.8777 

0.8857 

0.8938 

Os 

0.2497 

0.4922 

0.6089 

0.6737 

0.7143 

0.7421 

0.7623 

0.7776 

0.7896 

0.7993 

0.8139 

0.8286 

0.8436 

0.8511 

0.8587 

0.8663 

0.8739 

0.8817 

0.8894 

oo 

ddddddddddddddddddd 

s 

0.2497 

0.4918 

0.6077 

0.6716 

0.7115 

0.7386 

0.7582 

0.7730 

0.7845 

0.7938 

0.8078 

0.8218 

0.8359 

0.8430 

0.8501 

0.8572 

0.8642 

0.8713 

0.8785 

« 

ddddddddddddddddddd 

U3 

0.2497 

0.4910 

0.6055 

0.6681 

0.7067 

0.7327 

0.7514 

0.7655 

0.7764 

0.7851 

0.7982 

0.8112 

0.8242 

0.8307 

0.8371 

0.8436 

0.8500 

0.8564 

0.8627 


0.2496 

0.4904 

0.6037 

0.6651 

0.7028 

0.7280 

0.7461 

0.7596 

0.7700 

0.7784 

0.7908 

0.8032 

0.8155 

0.8215 

0.8276 

0.8336 

0.8396 

0.8455 

0.8514 

«o 

0.2496 

0.4893 

0.6009 

0.6605 

0.6968 

0.7209 

0.7380 

0.7508 

0.7606 

0.7685 

0.7801 

0.7916 

0.8030 

0.8086 

fc.8141 

0.8197 

0.8252 

0.8306 

0.8359 


0.2495 

0.4872 

0.5954 

0.6521 

0.6861 

0.7085 

0.7242 

0.7359 

0.7448 

0.7519 

0.7624 

0.7728 

0.7829 

0.7879 

0.7929 

0.7978 

0.8026 

0.8074 

0.8121 

- 

0.2492 

0.4808 

0.5806 

0.6309 

0.6603 

0.6794 

0.6926 

0.7024 

0.7098 

0.7157 

0.7243 

0.7328 

0.7411 

0.7451 

0.7492 

0.7531 

0.7570 

0.7608 

0.7646 

s/ 
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Since, under the circumstances assumed, the assignment of the 
designations I and II to the products is arbitrary, it is natural to require 
that (26) be the reciprocal of (25). For this to be the case, it is necessary 
that 

*) = F B {n h n t ) • F 1 . B (n 1 ,n t ) = (27) 



Fio. 8.3.—Operating-characteristic curves for experiments to determine whether a 
particular one of two new products (processes) is more variable than the other, 0.05 
level of significance. 


It is clear that when n x = n 2 , equation (27) holds for every value of 0; 
it is believed that the condition n x = n 2 is necessary for the equality 
to hold. Consequently, if symmetry of action is considered desirable in 
the situation considered in this section, the test should be carried out 
with the same number of degrees of freedom for s f and 
Values of 

0(0.50,0,n,ft) = Fp(n,n) = 0(0,0.50,n,ft) (28) 

can be obtained from Tables 8.3 and 8.4 by means of (27). For example, 
0(0.50,0.05,30,30) - 0(0.05,0.50,30,30) = 1.841, from Table 8.3. 
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5. EXPERIMENTS INVOLVING THE ANALYSIS OF VARIANCE 

5.1. Systematic and Random Variation 

Many experimeT bs are analyzed by a technique (due to R. A. Fisher) 
known as the analysis of variance.* Tables 8.3 and 8.4 are useful in 



Fig. 8.4.—Operating-characteristic curves for experiments to determine whether a 
particular one of two new products (processes) is more variable than the other, 0.01 level 
of significance. 


planning a certain class of such experiments, namely, those for detect¬ 
ing and measuring components of random variation. 

The factors associated with the variation of a product can be divided 
into two categories: , 

(1) There are factors that introduce random variation. Lack of 
perfect control at some stage of a production process often acts in this 

* Excellent accounts of the analysis of variance, which are especially pertinent to 
this chapter, are given by Baines, Economical Design , and Daniels, “Components of 
Variance.” 
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8 

50100. 

527.2 

111.8 

50.02 

30.22 

21.27 

16.37 

13.33 

11.30 

9.846 

7.926 

6.273 

4.842 

4.197 

3.589 

3.012 

2.453 

1.883 

1.000 

8 

51850. 

551.1 

117.9 

53.16 

32.33 

22.89 

17.71 

14.51 

12.35 

10.81 

8.775 

7.022 

5.507 

4.825 

4.187 

3.587 

3.016 

2.461 

1.896 

s 

53630. 

576.5 

124.4 

56.52 

34.60 

24.64 

19.17 

15.77 

13.49 

11.85 

9.694 

7.833 

6.225 

5.504 

4.830 

4.198 

3.603 

3.037 

2.487 


55500. 

603.4 

131.4 
60.12 
37.03 
26.52 
20.73 
17.14 
14.71 
12.98 
10.69 
8.710 
7.002 
6.236 
5.522 
4.854 
4.228 
3.638 
3.076 

3 

57470. 

632.4 

138.9 

63.98 

39.65 

28.55 

22.42 

18.61 

16.04 

14.20 

11.76 

9.661 

7.844 

7.029 

6.270 

5.562 

4.899 

4.277 

3.691 

a 

59550. 

662.5 

146.8 

68.11 

42.46 
30.73 
24.24 
20.20 

17.47 
15.52 
12.93 
10.69 
8.757 
7.889 
7.081 
6.327 
5.624 

4.965 

4.347 


61740. 

694.8 

155.3 

72.54 

45.49 

33.07 

26.20 

21.92 

19.02“ 

18.95 

14.19 

11.81 

9.747 

8.822 

7.961 

7.158 

6.409 

5.709 

5.053 

•o 

66490. 

765.7 

174.0 

82.39 

52.23 

38.33 
30.61- 
25.78 
22.51 
20.16 
17.04 

14.33 

11.99 

10.94 

9.955 

9.040 

8.187 

7.390 

6.646 

e* 

71770. 

846.0 

195.5 

93.73 
60.03 

44.43 

35.74 

30.29 

26.59 

23.93 

20.39 

17.31 

14.64 

13.43 

12.31 
11.27 

10.29 
9.380 
8.529 

© 

77670. 

937.0 

220.0 

106.8 

69.07 

51.53 

41.72 

35.56 

31.37 

28.35 

24.32 

20.82 

17.76 
16.39 
15.11 
13.91 
12.79 

u 7A 

10.77 

o» 

81990. 

1005. 

238.4 

116.6 

75.89 

56.90 
46.26 

39.56 
35.00 
31.72 
27.32 
23.50 
20.16 
18.65 
17.25 
15.94 
14.71 

13.56 
12.49 

oo 

87860. 

1097. 

263.8 

130.3 

85.42 

64.42 
52.63 
45.19 
40.12 
36.46 
31.56 

27.29 
23.55 
21.87 

20.29 
18.82 
17.44 
16.15 
14.94 

N 

96250. 

1232. 

301.1 

150.5 

99.56 

75.62 

62.13 

53.61 

47.79 

43.58 
37.93 
33.00 

28.68 

26.73 

24.90 

23.19 

21.58 
20.08 
18.68 

to 

109200. 

1445. 

360.5 

182.9 

122.3 

93.75 

77.57 

67.31 

60.29 

55.20 

48.37 

42.38 
37.12 
34.74 
32.51 
30.41 
28 45 
26.61 
24.88 


131300. 

1819. 

466.8 
241.5 

163.8 

126.9 

105.9 
92.59 
83.42 
76.76 

67.78 
59.89 

52.94 

49.78 
46.81 

44.03 

41.41 

38.95 
36.64 


176200. 

2609. 

696.5 
369.9 

255.8 

201.0 

169.7 

149.6 
135 8 

125.7 
112.1 
100.0 
89.35 
84.49 
79.92 
75.62 
71.56 
67.74 
64.14 

eo 

300200. 

4938. 

1398. 

771.2 

547.4 

438.5 

375.5 
335.0 

306.8 

286.2 

258.3 

233.4 
211.2 
201.1 

191.5 

182.5 

173.9 
165.8 
158.2 

« 

992400. 

19700. 

6138. 

3586. 

2646. 

2178. 

1903. 

1724. 

1599. 

1507. 

1381. 

1268. 

1167. 

1120. 

1075. 

1033. 

992.9 

954.9 
918.8 

- 

65690000. 

1970000. 

737400. 

477000. 

374800. 

322100. 

290400. 

269400. 

254400. 

243300. 

227900. 

213900. 

201100. 

195100. 

189400. 

183900. 

178700. 

173700. 

169000. 

/ 8 
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8 

646.3 

28.62 

10.56 

6.471 

4.825 
3.961 
3.434 
3.080 

2.825 
2.634 
2.363 
2.111 
1.871 
1.754 
1.637 

1.519 

1.395 

1 256 
1.000 

S 

646.6 

28.69 

10.61 

6.515 

4.868 

4.004 

3.477 

3.123 

2.870 

2.679 

2.410 

2.161 

1.925 

1.811 

1.609 
1.587 
1.474 
1 355 
1.222 

8 

646.9 
28.76 
10.66 
6.558 
4.910 
4.046 
3.519 
3.166 
2.913 
2.723 
2.456 
2.209 
1.977 
1.865 
1.756 
1.649 
1.542 
1 433 
1.319 

3 

647.2 

28.82 

10.70 

6.602 

4.952 

4.087 

3.561 

3.208 
2.956 
2.767 
2.501 
2 255 
2.026 
1.917 
1.810 

1.705 

1.603 

1.500 

1.396 

3 

647.4 

28.89 

10.75 

6.645 

4.993 

4.128 

3.602 

3.250 

2.998 

2.809 

2.545 

2.300 

2 . U 73 

1.965 

1.861 

1.759 

1.659 

1.560 

1.462 

3 

. s s 8 § S 3 s i S § 3 S §11g S § 
-~ 

3 

o s s g 13 § 31 § 1 Si i i § g 15 
- ' - - 

»o 

|gO<D‘0'tM«eONWNiNIN(Nrtrtrt— ! 

CM 

649.2 

29.30 

11.04 

6 902 

5.238 
4.369 
3.841 
3.480 

3.238 
3.050 
2.788 
2.549 
2.329 
2.226 
2.127 
2.031 
1.939 
1.851 

1.766 

© 

649.8 

29.43 

11.14 

6.987 

5.318 

4.447 

3.918 

3.565 

3.314 

3.126 

2.865 

2.626 

2.407 

2.304 

2.200 

2.111 

2.020 

1.933 

1.849 

03 

650.2 

29.53 

11.20 

7.043 

5.371 

4.498 

3.968 

3.615 

3.364 

3.176 

2.915 

2.676 

2.457 

2.355 

2.256 

2.162 

2.072 

1.986 

1.902 

oo 

650.7 
29.64 
11.28 
7.113 
5.437 
4.562 
4.031 
3.677 
3.425 
3.237 
2.976 
2 737 
2.518 
2.416 
2.318 
2.224 
2.135 
2.049 

1.966 

N 

651.3 

29.79 

11.39 

7.203 

5.522 

4.643 

4.111 

3.756 

3.503 

3.315 

3.053 

2.814 

2.595 
2.493 
2.395 
2.302 
2.212 
2 127 

2.044 

CO 

652.2 

29.99 

11.52 

7.323 

5.633 

4.751 

4.216 

3.859 

3.606 

3.416 

3.154 

2.914 

2.695 

2.593 

2.495 

2.401 

2.312 

2.226 

2.144 

«9 

653.5 
30.28 
11.72 
7.490 
5.788 
4.899 
4.360 
4.001 
3.746 
3.555 
3.291 
3.050 
2.829 
2.727 
2 628 
2.535 
2.445 
2.359 
2.277 


655.5 

30.71 

12.01 

7.740 

6.019 

5.119 

4.573 

4.210 

3.951 

3.758 

3.491 

3.247 

3.024 

2.920 

2.821 

2.726 

2.635 

2.548 

2.465 

C4 

658.9 

31.45 

12.51 

8.157 

6.400 

5.481 

4.922 

4.550 

4.286 

4.088 

3.814 

3.563 

3.335 

3.228 

3.126 

3.029 

2.936 

2.847 

2.762 

CM 

666.6 
33.00 
13.52 
9.000 
7.164 
6.200 
5.612 
5.220 
4.941 
4.732 
4.442 
4.176 
3.933 
3.820 
3.711 
3.607 
3.506 
3 413 
3.322 

- 

695 2 
38.31 
16.86 
11.73 
9.606 
8.477 
7.784 
7.319 
6.984 
6.734 
6.385 
6.063 
5.768 
5.629 
5.496 
5.368 
5.245 
5.127 
5.014 

«/ 
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manner, and the material produced exhibits an inherent random vari¬ 
ability. This random variation may have assignable components. For 
instance, the thickness of washers punched from a single sheet may be 
regarded as varying at random about the mean thickness of washers from 
that sheet, because of the inherent nonuniformity of the sheet; and 
washers produced during a given shift may have still greater variation 
about the mean thickness for the shift, because of an additional compo¬ 
nent of variation assignable to the random variation of the mean thick¬ 
nesses of the respective sheets about the over-all mean thickness of the 
sheets on that shift, and so forth. 

(2) There are factors that introduce systematic variation. For 
instance, in a production unit composed of several similar machines 
producing the same product simultaneously, there are generally sys¬ 
tematic differences among the machines, which superpose a pattern of 
variation on the output of that production unit. 

In using analysis of variance merely to determine whether the 
over-all variation has a component assignable to a particular factor, the 
computational procedure and the mechanics of the statistical test are 
the same in the case of a factor that may contribute a component of random 
variation as in the case of a factor that may contribute a component of 
systematic variation. On the other hand, in planning such investiga¬ 
tions and in the estimation of the magnitude(s) of the component(s), 
the distinction has to be taken into account, since the situations between 
which discrimination is required differ in the two cases, leading to 
differences in the computation of probabilities of errors of the second 
kind. We consider only cases where random variation alone is present. 

The question arises as to how to decide in a given case whether a 
component of variation is systematic or random. The answer depends 
upon the extent to which the experimental procedure samples the material 
at random. If a sampling operation is such that each item of the 
population has an equal chance of being included in the sample, and 
separate drawings are independent except for any dependence resulting 
from the number of items in the population being limited, then the 
sampling is random and the sample may be termed a random sample.* 
When an experimenter selects two or more treatments for comparison 
in some respect, he rarely, if ever, selects them at random from a popula¬ 
tion of possible treatments; he selects those be believes most promising. 
Accordingly, components of variation assignable to differences between 
treatments are systematic. On the other hand, when an experimenter 

* It is important to note that it is the operation of drawing the sample that is 
random and not the sample itself, since any sample whatsoever can be obtained by 
either a random or a nonrandom operation. 
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selects a sample of items from a production lot (or a sample of animals 
from a herd) for a study of the effects of the treatments, he can ensure 
that they are a random sample from the lot, or herd, by introducing 
randomization into the sampling procedure, for example, by drawing 
cards from a hat alter thorough mixing, or by using a table of random 
numbers. He may consider such a sample to be a random sample from 
the production process (or the animal species) only by making the 
assumption that the lot (herd) itself is a random sample from the process 
(species). When a particular experiment is being planned, or the results 
of a completed experiment are being interpreted, the following two 
parallel sets of questions will serve to focus attention on the issue of 
systematic versus random variation: ■ 

(1) Are the conclusions to be confined to the things actually studied 
(the sample of items, or animals), or to the immediate sources of these 
things (the actual machines, or lots, or herds from which the things 
studied were obtained); or atre they to be expanded to apply to the under¬ 
lying populations (real or hypothetical population of machines, or the 
production process, or the species') ? 

(2) In complete repetitions of the experiment, would the same 
things be studied again (the same items, animals, or treatments); would 
new samples be drawn from the identical sources (the same machines, the 
same lots, or the same herds); or would new samples be drawn from the 
underlying populations (samples of items from different machines, or 
animals from different herds of the same breed)? 


6.2. A Typical Analysis of Variance 


This section gives a typical analysis-of-variance table for cases in which 
the random deviations involved are normally distributed. The condi¬ 
tions are specified under which the test of significance customarily 
associated with this table can be regarded as a test of the hypothesis 
that two variances <r\ and <j\ are equal, relative to the alternative that 
o\ s <r| + K(j\ y where K is a known constant and a\ is an unknown 
variance. Under these conditions the results given in Sec. 4.1 can be 
used in the planning and interpretation of experiments leading to analysis 
of variance tests of significance. 

Let xa denote the jth observation in the ith group (i = 1, 2, • • • , 
k; j = 1, 2, • • • , rii), let z». denote the average of the n* observations 
in the ith group (t = 1, 2, • • • , k), and let x.. denote the average of 

k 


all N 


i 


m observations; then the analysis-of-variance table takes 


the following form: 



312 


TECHNIQUES OF STATISTICAL ANALYSIS 


Chap. 8 


Variation 

Degrees 

of 

freedom 

- -- •" i 

Sum of squares 

1 

Mean square 

Expecta¬ 
tion of 

mean 

square 

Between group 


k 

n 



averages.... 

k - 1 

SI = 2, **(*<• ~ X ”) 2 

l-l 
k m 

8t/{k - 1) - 


Within groups. 

N - k 

si * \ X ~ xu ** 

- fc) - *2 

<r« 


k 

n ■» ( N 2 — ^ n\)/[N{k — 1)]; when m = n 2 = • • • nk — n, ft = n. 

t-1 


If the Xu are independently and normally distributed about the 
respective true means m* of the groups with common (unknown) variance 
o’ 2 , and if the groups are a random selection from a population of groups' 
such that in repeated selection the m* would be normally distributed 
about an over-all population mean of m with variance <r 2 , the xq will be 
normally distributed about m with variance * 


- * 2 + cr 2 

likewise , 

- 1 

<r 2 + fia 2 


(29) 

(30) 


will be distributed as x 2 for (A; — l) degrees of freedom; and finally 


(N - k)s 2 


(31) 


will be distributed independently as x 2 for (N — k) degrees of freedom. 

A comparison of (30) and (31) with the assumptions made with 
regard to si and si in Sec. 4.1 reveals that if we set 

<r\ = <r 2 + racr 2 and a \ = <r 2 (32) 

n\ — (k — 1) and n 2 = N — k (33) 

8? becomes identified with s 2 and s 2 with s|, and the rule of procedure 
becomes 

(a) Accept <r 2 = 0 (there exists no between-group component of 
random variation) whenever 
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(6) Accept <j\ > 0 (there exists a between-group component of random 
variation) whenever 

*4 > Fa(k - 1, N - k ) (35) 

s v> 

Since cannot be negative, the probability of an error of the first 
kind (concluding that cr 2 > 0 when actually <r 2 = 0) is exactly a, and 
the probability of an error of the second kind (concluding that <r\ = 0 
when actually <r 2 > 0) is P when 

1 + ” (S) ■ k ~ l > N ~*) (36) 

If there are n observations in each group, h = \"i and 
N k *= k(n — 1); 
so* 

?£ — k(n 1 )] 1 

5.3. Illustrations 

The following examples illustrate types of questions to which answers 
may be obtained with the aid of (37) : 

Example VII. It is suspected that differences between lots 
contribute to the random variation of a particular characteristic of a 
certain product. To investigate this hypothesis, n items are to be 
selected at random and measured, from each of k lots chosen at 
random from the lots currently accessible. It is agreed that the test 
of significance defined by (34) and (35) is to be conducted at the 0.05 
level of significance, and it is desired to have a probability not 
greater than 0.50 of falsely concluding a\ — 0 if <r*/cr 2 = 0.10. What 
combination of values of k and n will accomplish this with a minimum 
total number of measurements? 

Baines has prepared the following table for answering such 
questions when a = 0.05 and P = 0.50: 



It is seen that the best combination is 19 items from each of 5 lots, 
requiring a total of 95 measurements. However, if it is more con- 


* Baines, Economical Design, tabulates (37) for a — 0.05, 0 ■■ 0.50, fc -» 2 to 31 
by steps of 1, and n «* 1 to 31 by steps of 1, 31 to 45 by steps of 2, 45 to 93 by steps of 
4, 101, and 121. 
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venient to use 4 lots than 5 lots, 24 items from each of 4 lots will be 
equally effective and will require only one additional measurement. 

Example VIII. If it were agreed initially in planning a test of 
the type discussed in Example VII that a total of 96 measurements 
would be employed, then the problem would arise of determining 
the combination of values of k and n, subject to the restriction 
kn = 96, that minimizes the value of <7 2 /<r 2 for a given 0. 

Baines gives the figures tabulated below, which apply when 
a = 0.05 and 0 = 0.50: 


mm 

■ 



























It is clear that the combination k = 4, n = 24 provides the most 
stringent test having a *= 0.05 and kn = 96, in the sense that it 
yields the minimum indifference point. 

In order to answer readily questions of the type illustrated by Exam¬ 
ples VII and VIII when a and 0 differ from 0.05 and 0.50, respectively, 
tables of (37) should be prepared using k and n as arguments. Indeed, to 
facilitate the planning of experiments leading to analysis-of-variance 
tests corresponding to two-way classifications (“rows” and “columns”), to 
Latin square designs, and so forth, sets of such tables will have to be 
prepared for each type of design, since the number of degrees of freedom 
corresponding to the estimate of error variance depends upon the 
design of the experiment [for a two-way classification, for example, 
n 2 = (r — l)(c — 1)}, as does also the number of degrees of freedom 
corresponding to the estimate of variance s? to be tested. Until such 
tables are available, these questions can be answered, with considerable 
effort expended on interpolation, by means of Tables 8.3 and 8.4. Fur¬ 
thermore, Tables 8.3 and 8.4 will often suffice to determine a few points 
on the operating-characteristic curve of a test of significance correspond¬ 
ing to an experimental design already agreed upon. 

6. TECHNICAL NOTES 

6.1. Method of Construction and Accuracy of Tables 8.1 and 8.2 

As explained in Sec. 2, the standard procedure for determining 
whether a new product or process is more variable than a standard 
product or process is to compute the statistic «f/cr§ and 
(a) Accept the conclusion <rf/<r§ < 1 whenever 
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( b) Accept the conclusion a\/a\ > 1 whenever 


«! > x^(ni) 
ol n i 


(39) 


where x«(^i) is the a level of significance of x 2 for ni degrees of freedom. 
The level of significance a represents the probability of accepting the 
conclusion a\/a\ > 1 when in fact a\/a\ = 1 (error of the first kind). 
Using this level of significance, the probability of accepting the con¬ 
clusion a\/a\ < 1 when in fact a\/a\ > 1 (error of the second kind) 
depends on the ratio p = a\/a\ and is exactly 0 if p has the value 


■ sS ' <40) 

The proof is elementary, Since the conclusion <rf /<t\ < 1 will be reached 
whenever (38) occurs, the probability of thi3 conclusion, given a\ = pal, 
is 


\~2 < -vI *= pat\ - P — ^ 2 X«(ni) 

Uo ni ) <Ti <7 X 


= P 


X 2 (n x ) < 


xJ(»i) 


- 1 -p X 2 (n x ) > 


— pal | 


From (40), 


x2(ni) 


xLpfai) 


so,the last form of (41) becomes 

1 - (1 - 0 ) - P 


(41) 


(42) 

(43) 


which completes the proof.* 

Thompson’s table of x«(^i) was used in constructing Tables 8.1 
and 8.2. Direct substitution of these values in formula (40) yields 
values of p(a,0,n) corresponding to a = 0.05 and 0.01 and to = 0.005, 
0.01, 0.025, 0.05, 0.10, 0.25, 0.50, 0.750, 0.900, 0.950, 0.975, 0.990, and 
0.995 for n = 1 to 30 by steps of 1 and 30 to 100 by steps of 10. The 
values of p(0.05,/3,n), rounded to 5 figures or 4 decimals, are given in 

* Values of *J(ni) corresponding to a - 0.01, 0.02, 0.05, 0.10, 0.20, 0.30, 0.50, 
0.70, 0.80, 0.90, 0.95, 0.98, and 0.99 are given to 3 decimals for ni * 1 to 30 by steps 
of 1 in Table III of Fisher, Statistical Methods and in Table IV of Fisher and Yates, 
Statistical Tables , the latter of which also contains the values for a — 0.001. Values of 
xi(ni) corresponding to a - 0.005, 0.010, 0.025, 0.050, 0.100, 0.250, 0.500, 0.750, 
0.900, 0.950, 0.975, 0.990, and 0.995 are given to 6 significant figures for m — 1 to 
90 by steps of 1 and from 30 to 100 by steps of 10 by Thompson, “x* distribution.” 
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Table 8.1. The corresponding values of p(0.01,j8,n) are given in Table 

8 . 2 . The values of p(a,j8,n) for n = 120 were computed from values of 
x2(120) obtained from the percentage points of the F distribution tabled 
by Merrington and Thompson, by means of the relation 

x£(n) = nF a (n, oo) (44) 

Two-point interpolation in Tables 8.1 and 8.2 in terms of the argument 
l/y/n will yield about 2-decimal accuracy in most parts of these tables. * 

6.2. Method of Construction and Accuracy of Tables 8.3 and 8.4 

The procedure described in Sec. 4.1 for determining whether a particu¬ 
lar one of two new products or processes is more variable than the 
other is as follows: Compute the statistic sl/sl, where and sf are 
sample estimates of the two population variances such that in repeated 
sampling n\s\/<r\ and will be distributed independently as x 2 for 

n i and n 2 degrees of freedom, respectively. Base the choice of product 
or process on the rule 

(a) Accept product I whenever 

^ < F a (n h n 2 ) (45) 

5 2 


(b) Accept product II whenever 

^ > F a (n u ni) (46) 

S 2 

where F a (n h n 2 ) is the a level of significance of the variance ratio (F) 
distribution. This rule is so formulated that the probability of accepting 
product II when in fact <r\/o\ = 1 (error of the first kind) equals the level 
of significance a . The probability of accepting product I when in fact 
c\ > <j\ (error of the second kind) depends on the ratio <t> = and 
is exactly 0 when 

< 47 > 

= F a (n h n 2 ) • F fi (n 2) ni) 

The proof is similar to the proof of (40). Product I will be chosen 
whenever (46) occurs, and the probability of this when <r\/a\ — <!> is 

* Interpolation with respect to a and P should be carried out in terms of K a and 
Kp, respectively, where K t is the unit normal deviate exceeded with probability 6 
(tabled in Chap. 1, Table 1.1). For an explanation and examples of this type of 
interpolation, see Simaika, Interpolation . 
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^17 


T (^ < - *} - F 



when 4 


Fq(ni,n 2 ) 

* 


1 - (1 - fi ) - fi 




(K) 


(49) 


Tables 8.3 and 8.4 at the *nd of this chapter were computed from 
the Merrington-Thompson table of F by direct substitution in formula 
(47). These tables give the values of 0(a,0,ni,n 2 ) for a = 0.050 and 
a = 0.010, respectively, corresponding to p = 0.005, 0.010, 0.025,0.050, 
0.100,0.250,0.500,0.750,0.900,0.950, 0.975, 0.990,0.995, for m = 1 to 10 
by steps of 1, and 12, 15, 20, 24, 30, 40, 60, 120, and n 2 = 1 to 10 by 
steps of 1, and 12, 15, 20, 24, 30, 40, 60, 120, °o. Interpolation for values 
of </>(a,0,ni,w 2 ) corresponding to values of n\ and n 2 between those given in 
these tables should be done harmonically, that is, in terms of 1/ni and 1 /n 2 . 
In this connection it should be noted that the sequence oo } 120, 60, 40,30, 
24, 20 corresponds to 120/m for m = 0, 1, 2, 3, 4, 5, and 6, respectively; 
and that the sequence °o, 60, 30, 20, 15, 12, 10 corresponds to 60/r for 
r = 0, 1, 2, 3, 4, 5, and 6, respectively. Accordingly, interpolation is 
most conveniently carried out in terms of the arguments m and r. Two- 
point harmonic interpolation will generally give 0 to sufficient accuracy 
for most practical purposes. For example, by 2-point harmonic 
interpolation 


<*(0.05,0.05,30,24) 


0(0.05,0.05,30,20) + 0(0.05,0.05,30,30) 0 _ . 

-—-= o.oo4 

(50) 


The actual value is 3.660; customary 2-point linear interpolation gives 

* The most accurate table of F a (ni,n 2 ) is that of Merrington and Thompson, which 
gives the values to 5 significant figures corresponding to a ■ 0.005, 0.010, 0.025, 
0.050, 0.100, 0.250, and 0.500 for ni — 1 to 10 by steps of 1, and 12,15,20,24,30,40,60, 
120, oo, and n 2 « 1 to 30 by steps of 1, and 40, 60, 120, oo. Values of F«(ni,n 2 ), 
referred to as e 2g , corresponding to o - 0.001, 0.01, 0.05, 0.20, are given (mostly to 
2 decimals) in Table V of Fisher and Yates, Statistical Tables for ni - 1 to 6 by steps 
of 1, and 8,12,24, «, and n% » 1 to 30 by steps of 1, and 40,60,120, w, 
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3.719. Similarly, by 2-point harmonic interpolation 


0(0.05,0.05,30,120) 


0(0.05,0.05,30,60) + 0(0.05,0.05,30, oo) 
2 


The actual value is 2.617. Use may be made of the fact that 
0(a,0,ni, oo) = p(a,/3,ni) 


2.618 

(51) 

(52) 


to obtain values of the left side of this equality for values of n\ included 
in Tables 8.1 and 8.2 but not in Tables 8.3 and 8.4. 
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CHAPTER 9 


UTILIZATION OF LIMITED EXPERIMENTAL FACILITIES 
WHEN THE COST OF EACH MEASUREMENT DEPENDS 
ON ITS MAGNITUDE 

1. TYPE \ OF PROBLEM 

A primary objective in the statistical design; of experiments is to 
maximize the amount of pertinent information qbtained from a given 
amount of time, equipment, personnel, and supplies, or, conversely, to 
minimize the cost of obtaining a given amount of information. In many 
experiments the cost of aq obeervation can be regarded as a constant; 
such is the case, for exampie. in the problem of Chap. 11. Sometimes 
the cost of an observation is determined by the independent variables, as, 
for example, when an independent variable is the amount of some expen¬ 
sive substance or type of energy. This chapter is concerned with a third 
situation, in which the cost of an observation depends on the value of 
the dependent variable; a simple case occurs when the dependent variable 
is determined by counting, so that the time and labor involved in a 
measurement are proportional to its magnitude. Insofar as the depend¬ 
ent and independent variables are correlated, the latter two situations 
are essentially the same. 

An interesting case in which the cost of an observation depends on 
its magnitude, one which illustrates the interplay of statistical principles 
and substantive considerations in applied statistics, occurs in metallurgical 
experiments to determine the relation between time to rupture and 
stress for a given alloy. In investigating this relation, each observation 
is obtained by subjecting a test specimen to a predetermined constant- 
stress and recording the amount of time that elapses before the specimen 
breaks. The number of observations that can be obtained in a given 
period of time from a single piece of equipment thus depends on how long 
it takes each specimen to rupture, which in turn depends largely on the 
stress to which the test specimen is subjected. 

If the logarithm of time to rupture is a linear function of the logarithm 
of stress (and there is strong evidence that this is approximately so 
for a large number of alloys over the range of important stresses), tests 
at any two stresses suffice to estimate the line relating the two variables, 
and from this line can be predicted the time to rupture at any other 
stress. The error in such a prediction depends on the number of tests 
from which the line is estimated and on the stresses at which they are 
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made. But the number of tests made itself depends on the stresses at 
which they are made, since the lower the stress, the longer it takes for a 
specimen to break, and hence the fewer the tests that can be made in a 
limited period with given equipment. This factor, taken by itself, would 
argue for using as high stresses as possible, in order to have many tests. 
However, the higher the stresses, the farther it is necessary to extrapolate 
the line to predict time to rupture at the stresses which are practically 
important. Further, while the percentage error in test results seems 
reasonably independent of the stress used for a fairly wide range of 
stresses, it will doubtless increase if the stress is raised sufficiently. 
Thus there is a real problem about the best stresses at which to test; 
there are opposing considerations that have to be evaluated quantitatively 
to get a definite answer. 


2. SPECIFIC PROBLEM 

It will facilitate the solution of the problem to state it more formally, 
as follows: 

Given: (1) The relation between time to rupture and stress is 

t = a + 0s (1) 

* 

where t is the logarithm of time to rupture, s is the logarithm of the 
stress, and a and 0 are constants describing the “true” (but unknown) 
relationship. 

(2) The number of specimens that can be tested is limited solely by 
the fact that there are a limited number of machines available and a 
limited amount of time. 

(3) The percentage error in the determination of time to rupture 
does not depend on stress; that is, if a great many determinations were 
made at each of a number of stresses, the percentage variation in the 
determinations would be the same at all stresses. This assumption will 
undoubtedly break down at sufficiently high stresses, but it seems to 
be confirmed by the available evidence for stresses ordinarily used. 

Required: (1) The stresses at which to make the tests, and (2) the 
best apportionment of the tests among these stresses, in order to estimate 
as accurately as possible (a) the slope 0 of the line relating t to $, and (6) 
the time to rupture at some predetermined stress, or range of stresses, 
of practical importance, ordinarily considerably below the stresses at 
which the tests are conducted. 

3. SOLUTION 

The solution of this problem is as follows: 

(1) It is always better to use two testing stresses than to use more 
than two. 
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(2) For purpose (6) it might sometimes be best to use only one stress 
So of practical importance. However, since this would give no informa¬ 
tion about the time to rupture at any other stress, it would ignore purpose 
(a) and is therefore.excluded. 

(3) Let Si and S 2 be the two best testing stresses (S 2 > $ 1 ). Then 
the best ratio of the two stresses, say, R = Si/S 2 , is given by solving 
the following equation for R : 

\ log. R - 1+ R~* (2) 


This is the best ratio both for estimating the slope and for estimating 
the time to rupture at So, provided So is less than S lf the case of primary 
interest.* The best ratio of the two testing stresses is therefore indepen¬ 
dent of So and depends only on 0. The value of P is, of course, to be 
determined from the experiment; but some advance estimate is needed 
because the relative times required for tests at two different stresses 
depend on p. However, the advance estimate need not be very accurate, 
since even fairly large errors in the estimate of p will not lead to any 
great loss in efficiency. 

(4) Under the given assumptions, the absolute values of Si and S 2 
cannot be determined. So long as assumption (3) of Sec. 2 is satisfied, 
namely, that the percentage error is independent of stress, the higher 
the testing stresses the better. In practice, therefore, S 2 should be 
chosen as the highest testing stress for which the percentage error can 
reasonably be expected to be no larger than at lower stresses. /Si can 
then be determined from the R given by equation (2). 

(5) Let ni be the number of tests to be made at Si and n 2 the number 
to be made at /S 2 . Let r = ni/ri 2 . The ratio r is all that needs to be 
determined, since the total number of tests, n 1 + n 2 , is determined by 
Si and S 2 and by the time and equipment available. The best value of r 
depends on the objective. If the objective is to estimate p as accurately 
as possible, the best value of r is given by 

r - R 5 (3) 


If the objective is to estimate the time to rupture at So as accurately 
as possible, the best value of r is given by 


r 


R 


1 So 
■% 8 & 
1 So 
108 * 


( 4 ) 


* See Sec. 4.3, especially the last paragraph, for a discussion of the case where 
S\ 2S So. 
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The best value of r in this case depends on So . This means that it is 
not possible to design the experiment so as to get the best estimate of 
the time to rupture at each of a range of So’s. It is necessary to com¬ 
promise among the objectives. 

(6) The following numerical calculation illustrates the solution. 
For one particular series of cast alloys tested at a particular elevated 
temperature, 0 is found to be approximately — 8. If S 2 is taken as 30,000 
pounds per square inch (p.s.i.), the best value of Si turns out to be 22,000 
p.s.i. The best apportionment of the tests between the two stresses is as 
follows: 


Objective is to estimate as 
accurately as possible 

Number of tests to 
be made at 22,000 
p.s.i. for each test 
at 30,000 p.s.i. 

Percentage gain in precision 
over using 0.50 tests at 
25,000 p.s.i. for every test 
at 30,000 p.s.i. 

(a) Slope. 

0.26 

39 

(6) Time to rupture at stress of 
5,000 p.s.i. 

0.35 

42 

10,000 p.s.i. 

0.41 

45 

15,000 p.s.i. 

0.53 

47 

20,000 p.s.i. 

1.25 

> 

55 



The last column gives the gain in precision, which is equivalent to the 
savings in testing for a given degree of precision, over an alternative 
combination of stresses and relative number of observations, which 
differs only moderately from the optimum combination; it indicates the 
value of careful planning of the stresses at which to test and of the relative 
number of tests at different stresses. 

4. MATHEMATICAL DEVELOPMENT 

4.1. Notation 

Let T — time to rupture 

t = logarithm of time to rupture 
S = stress 

50 = stress for which it is desired to estimate time to rupture 

51 = smaller of two testing stresses to be used 
S% = larger of two testing stresses to be used 
rii = number of tests to be made at Si 

n% — number of tests to be made at S 2 
n = ni + n 2 
R = S1/S2 
r * ni/n% 

a, P -* parameters of line relating log T to log S , so that 


log T a + j8 log S 


(5) 
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a, b = estimates of a and 0 

to = estimate of logarithm of time to rupture at stress So 
K = total amount of machine time available 
a = standard deviation of observations about the line (5), 
assumed the same for all S 
db = standard deviation of estimated slope 
at = standard deviation of logarithm of estimated time to rupture 
For convenience, take all logarithms to be natural logarithms. 

We assume that the parameters of the line are estimated from the data 
by the usual least-squares procedure applied to the logarithms of the 
variables. Then, by algebraic manipulation of the usual formulas,* 



( 6 ) 

(7) 


The problem is to minimize (6) and (' 7 ) by proper choice of ni, n 2 , S h 
and S 2 , subject to the constraint that a fixed amount of machine time is 
used. Since T = e a S fi } this constraint is 


nie a S i + n 2 e a S2 = K 

or 

UiSi + n 2 S\ = Ke~ a = k 


( 8 ) 

(9) 


It will clarify the solution and yield useful supplementary results to 
proceed to the final solution in stages, rather than at one step. We shall, 
therefore, first assume Si and S 2 given and determine the optimum value 
of r; thereafter we shall proceed to determine the optimum value of R . 


4.2. Objective to Get Best Estimate of Slope 

The problem is to maximize 

V 2 7l\Tl 2 $i\ /in\ 

ni + n* V g sj (10) 

subject to (9), since this will minimize the variance of the slope as given 
by (6). 

Replacing rn by rn 2 in (9) and (10), solving (9) so modified for n*, and 
substituting the result for n* in (10) gives 



K log f;)‘ 

(1 + r)(rS? + S?) 


( 11 ) 


* See Fisher, Statistical Methods , Sec. 26, pp. 133, 135. (For full citation, Bee 
Sec. 5.) 
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to be maximized with respect to r, without restraint and with Si and S* 
assumed fixed. Setting the derivative with respect to r equal to zero 
gives 


K) 


&V QSf - r 2 Sj) 


(1 + rY(rS{ + Si)* 


A solution to (12) is obtained by setting 

S f t - r*S? = 0 


- ( I )' 5 - 


where we take the positive square root, as r is essentially positive. A 
check on the second derivative shows that this solution does maximize 
(11). Substituting (14) into (11) and taking the reciprocal gives 

_ QSi* + &*) 2 (15) 

"‘K'-iy - 


as the minimum ratio of the variance of the estimated slope to the 
variance of an individual observation, given Si and S 2 . 

Replacing Si by flS 2 in (15) gives 

ft = (ft* ± mi 

a* A:(log ft) 2 K > 

Consider S 2 as fixed and minimize (16) with respect to R; this gives 


2 s 5 (« s +1)rg B §i H _gg I 
dR lcR(log R)‘ La J 


whence 


^ log ft - 1+ ft"f 


A check on the second derivative shows that this solution does minimize 
( 16 ). 

Substituting the value for log ft from (18) in (16) gives 

<j\ _ (3WSj 
4fc 


(19) 
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as the minimum value of (16), given S 2 . Since 0 is negative, it is clear 
that the larger £ 2 , the smaller the variance; hence there is no finite 
optimum value of $ 2 . In practice, increases in S 2 beyond some point will 
increase <r 2 , and the optimum value of S 2 will be somewhere near this 
point.* 


4.3. Objective to Get Best Estimate of Time to Rupture at a Given Stress 

The problem here is to minimize (7) subject to (9). Replacing n 1 by 
rn 2 in (7) and (9), solving (9) so modified for r& 2 , and substituting the 
result in (7) gives 


K*SH»3)rH + « 

K io *D’ 


( 20 ) 


to be minimized with respect to r, without restraint and with Si and S 2 
assumed fixed. Setting the derivative of (20) with respect to r equal to 
zero gives 





( 21 ) 


From (21) the minimizing solution is given by 



The positive sign is required to make r positive if So is less than Si or 
greater than S 2 ; the negative sign, if Si < S 0 < S 2 . The second deriva¬ 
tive shows that these signs also give minima. Since we are concerned 
primarily with values of So that are below Si, we shall hereafter consider 
only the positive sign. 

* It will pay to go somewhat beyond the point at which <r* starts to increase, since 
the loss from the increase in <r* will for a time be less than the gain from the increase 
in Si. * 
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Substituting (23) into (20) gives as'the minimum ratio of the variance 
of the estimated rupture time to the variance of an individual observa¬ 
tion, with /Si and /Ss fixed, 


A. 


K 108 © 5 


(24) 


On replacing Si by RS?, (24) becomes 

< I (1 + R *> log 5 


k 


log R 


(25) 


from which, holding Si fixed, 




dR 


25? 

k 


(1 + R*) log |° 
02 


log R 


- 1 


log K - 1 


O' 

R 2 


\ , S 0 

) log Si 


R (log R)* 


= 0 , (26) 

For /So less than /Si, there is no value of R that will make the first bracket 
equal to zero. Accordingly, the minimizing value of R is given by 


0 0 
^ R 2 log R - 1 - R 2 = 0 


or 


| log R - 1 + R 


0 

2 


(27) 

(28) 


which is the same as (18), the solution for minimizing the variance of 
the slope. This is, of course, the minimizing value only if /S 2 is sufficiently 
large so that the derived S\ is larger than /S 0 . 

By differentiating (25) with respect to /S 2 , it can be shown that (25) 
decreases as /S 2 increases, so that there is no finite optimum value of /S 2 . 

If S 2 is chosen greater than /So, while Si derived from the R satisfying 
( 28 ) is less than /So, (28) no longer gives the optimum solution. In that 
case, it turns out that the variance is a monotonically decreasing function 
of R . The best that can be done is to take the largest possible value of 
JR, namely, R = /S0/S2, or Si = /So. This solution using two testing 
stresses will not, however, be as good as that putting all the tests at /Sa. 

5. REFERENCE. 

Fishes, R. A.: Statistical Methods for Research Workers , 4th and later ed., Oliver and 
Boyd, Ltd., Edinburgh and London, 1982-1044, 
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CHAPTER 10 


EXPERIMENTS IN WHICH THE INDEPENDENT VARIABLE 
IS KNOWN ONLY IN TERMS OF A 
PROBABILITY DISTRIBUTION 

1. PROBLEM 

1.1. Type of Problem 

This chapter deals with a type of problem in which it is desired to 
subject each of a number of items to a test of at least a given severity, 
but the severity of any one test cannot be measured and cannot be con¬ 
trolled precisely. Thus it i$ ne -essarv to make an independent study of 
the probability that a test will attain the specified severity and then to 
determine the number of tests necessary for each item in order to attain 
a given probability that at least one test will be of the specified severity. 
The particular problem discussed is relatively simple, and the chapter is 
intended more to suggest an interesting and in some respects important 
type of problem and to indicate an approach to it, than to present a 
solution or a novel application of statistical techniques. 

1.2. Specific Problem 

A gun barrel is considered fit for service with respect to its pressure- 
resisting properties if it has fired a round such that the gas pressure 
developed is not less than a prescribed pressure K ) K being greater than 
any pressure that may reasonably occur in service. Special ammunition 
that develops high pressures is used for proofing the barrels. In order to 
measure the pressure developed by a round, a hole must be bored in the 
barrel, which renders it unfit for service. Consequently, it is impossible 
to ascertain with certainty that a barrel intended for service has actually 
been proofed. It is, however, possible to draw conclusions about the 
percentage of barrels proofed from a study of the characteristics of the 
special ammunition. When a lot of the latter is delivered, a small fixed 
sample of size n is tested for pressure and the remainder (which is almost 
the whole lot) is used for proofing, N rounds being fired in each barrel 
to be proofed. Manufacture of the special ammunition is in a sufficiently 
satisfactory state that the proportion of defective rounds, that is, those 
which generate a pressure less than K , is usually very small (of the order 
of 0.001). The problem is, on the basis of the results from the sample of 
n, (1) to set up criteria for accepting or rejecting the lot, and (2) to 
determine N, the number of rounds to be fired in each barrel. 

331 
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2. SOLUTION 

Choose 7 , a confidence coefficient with which to operate, the minimum 
number of barrels that an acceptable lot should proof, and the minimum 
tolerable probability of proofing a barrel. From the sample of n obtain 
an estimate p y with confidence coefficient 7 of an upper bound on the 
proportion of rounds in the lot which, when fired, will generate a pres¬ 
sure less than K. How this estimate is to be obtained is described in 
Sec. 3. Construct a table as follows: 


a 


Number of rounds ( N ) 
fired in each barrel 

1 

2 


With confidence coefficient 7 , 
the probability that a barrel 
will be proofed is at least 
1 — Py 
1 -f>\ 


i 


1 Py 


From the table select the smallest N yielding the required probability of 
proofing a barrel. If this N is toolarge to enable the lot of ammunition 









Population proportion 
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Observed proportion ( x/n ) 

Fia. 10.2.—'Ninety-five percent confidence bolts for population proportion. 


to proof the minimum required number of barrels, reject the lot. Other¬ 
wise, fire N rounds in each barrel. 


3. CONFIDENCE LIMITS FOR PROPORTION DEFECTIVE 


3.1. Attributes 

If the upper confidence limit p y is determined by attributes, that is, by 
classifying each round in the sample of n as above K or below K , it is 
unnecessary to make any assumption whatever about the distribution 
of the pressures of the bullets of the lot. It is only necessary that the 
sample of n be selected at random. If it is believed that defective 
rounds do not occur independently (for example, if a machine functions 
improperly, all the bullets it produces may be defective and these may be 
adjacent in the lot), the N bullets to be fired from any gun barrel should 
be chosen at random from the lot. 

Perhaps the most accurate convenient method of finding p y is with 
the aid of Thompson’s table of percentage points of the incomplete 
beta function. Let 
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Population proportion (P) 



Observed proportion (*/n) 

Fiq. 10.3.—Ninety percent confidence belts for population proportion. 

c = number of rounds in the sample of n 


/ whose pressures are less than K (1) 

a = 100(1 - y) (2) 

v\ = 2(c + 1) (3) 

V 2 = 2 (n — c) (4) 


Enter the table with v\ and V 2 on the a percentage-point page and find x. 
Then p y = 1 — x. For example, if y = 0.975, c = 0, n = 30, we have 
a * 2.5, v\ = 2, i >2 — 60; from the table we find x = 0.88430, or po .975 
= 0.1157. The table of Sec. 2 then appears as follows: 

Minimum 
N probability 

1 0.8843 

2 0.9866 

3 0.9985 


etc. 
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Population proportion(P) 



Fio. 10.4.—Eighty percent confidence belts for population proportion. 

1 

Note: When c = 0, fi y = 1 — (1 — 7 )". 

When 7 = 0.995, 0.975, 0.95, or 0.90, may be found from Figs. 
10.1, 10.2, 10.3, or 10.4.* These show belts within which we may assert, 

* Figures 10.1 and 10.2 are reproduced from Pearson and Cloppcr, “Confidence 
Limits,” by permission of Biometrika. Figures 10.3 and 10.4 are based on computa¬ 
tions from Thompson’s “Incomplete Beta Function” made (in cooperation with the 
Statistical Research Group) by R. L. Deckinger of the Eureau of Ordnance, U.S. Navy, 
and furnished through the courtesy of R. H. Park. (For full citations, see Sec. 5.) 

With regard to Figs. 10.1 and 10.2, Professor E. S. Pearson writes in part: 
“Actually I believe they are not completely accurate, as the levels at certain points 
were obtained by interpolation and not by direct calculation from the 1 Incomplete 
Beta Function/ If we reproduce such diagrams ourselves, we shall have to look into 
this point.” (Letter of January 16, 1946, to one of the editors.) 

For Figs. 10.3 and 10.4, abscissas were computed for ordinates between 0 and 1 by 
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with confidence coefficient of, say, T, that the true proportion lies when 
the observed proportion is x/n. For the present purposes 7 is related 
to T by 

r = 2 7 - 1 (6) 

Thus, for the example of the preceding paragraph (where 7 = 0.975), 
T = 0.95. From Fig. 10.2 we see that the upper curve for n = 30 cuts 
the axis x/n = 0 at about 0.11 or 0 . 12 . 

Still another method of finding f> y is by means of the arcsin trans¬ 
formation described in Chap. 16. Thus 

+ ( 6 ) 

where Ki- y is the normal deviate exceeded with probability 1 — 7 (see 
Table 1.1 of Chap. 1 ). This’formula is not satisfactory, however, when 
cjn is very small, especially not when c = 0 . 

It should be noted that the foregoing assumes the lot to be large 
enough relative to the sample so that the effect of lot size can be ignored; 
this will almost always be the case in practice. 

3.2. Variables 

* 

If it can be assumed that the gas pressures of the bullets in the lot are 
random independent observations on a normally distributed variate, f> y can 
be determined by the method described in Chap. 1, Sec. 8.2 (pp. 42-45).* 
The value of T given by equation (5) of this chapter is used as the value 
of 7 in Chap. 1; in other words, L p of Chap. 1 corresponds with 1 — 7 of 
this chapter. 

4. SIZE OF PRELIMINARY SAMPLE 

While the rounds expended in testing the initial sample of n do not 
serve directly to proof machine-gun barrels, it would be an error to 

steps of 0.1, using Thompson’s “Incomplete Beta Function.” For a confidence 
coefficient T (in Fig. 10.3, T = 0.90; in Fig. 10.4, T = 0.80), the upper (lower) abscissa 
for a given value of n represents the value of x/n that will be exceeded (fallen short of) 
with probability (1 — r)/2 in samples of size n from a binomial population with 
parameter P. A smooth curve was passed through the 11 points, but is shown on 
the chart only over the range 1/n' < x/n < 1 — 1/n', where n' is the next sample 
size above n shown on the chart. Outside this range the curve was bent so that at 
abscissas 0 and 1 the ordinates are 1 — (1 — r) 1/n and (1 — r) 1/n , respectively, 
instead of 1 — [(1 — T)/2] 1/n and [(1 — r)/2] 1/n . The reason for the difference in 
computation at x/n ** 0 and 1 is that there the confidence interval must lie entirely 
to one side of the observed proportion. In deciding where to bend the curve, it was 
assumed that a given curve will never be used in the intervals 0 < x/n < 1/n' and 
1 — 1/n' < x < 1, and that only for interpolation will it ever be used in the intervals 
1/n' < x/n < 1/n and 1 — 1/n < x/n < 1 — 1/n'. 

* A. slightly more accurate method is given in Chap. 1, Sec. 11.4.6, Problem II, 
where % corresponds with 1 - 7 of this chapter (see pp. 76-77). 
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characterize them as wasted and to try to keep n as small as possible. 
Actually these rounds supply information about the quality of the ammu¬ 
nition, and more information can easily result in an actual saving of 
ammunition. Suppose, for example, that a lot contains 10,000 rounds, of 
which (unknown to ils) 10 (0.1 percent) are defective, that we wish to 
operate with a 0.99 confidence coefficient, and that the minimum tolerable 
probability of proofing a barrel is to be 0.99. Suppose further that in a 
sample of size 20 (n = 20 ) no defectives appear (c = 0 ). Proceeding as 
in Sec. 3.1, we construct the following table: 

n — 20 7 = 0.99 c =* 0 

a = O.Oi v\ = 2 v f — 40 

x - 0.79433 £ 0.99 - 0.20567 

Minimum 

N probability 

1 0 79 

2 0 96 

3 0.994- 

Three rounds are needed for each barrel, so the total number of barrels 
proofed by the lot will be 


= 3,326 (7) 

Now, suppose that a sample of 60 (n = 60) had been taken and no 
defectives found. Then 


n — 60 7 = 0.99 c = 0 

a — 0.01 i/i=2 Vi = 120 

x - 0.92612 po .99 = 0.07388 

Minimum 
N probability 

1 0.93 

2 0.994- 


Two rounds suffice for each barrel, and the total number of barrels proofed 
by the lot would be 


= 4,970 


( 8 ) 


a 49 percent increase over the previous result. 

Of course the number of defectives in the sample of n is a chance 
variable whose distribution depends on n, among other parameters. On 
the basis of previous experience with the ammunition being used, it is 
possible to determine that value of n which is optimum in the sense that 
if the quality of the ammunition is essentially unchanged with time, the 
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average number of machine-gun barrels successfully proofed will be 
maximized. 
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CHAPTER 11 


PLANNING AN EXPERIMENT FOR ESTIMATING THE MEAN 
AND STANDARD DEVIATION OF A NORMAL DISTRIBUTION 
FROM OBSERVATIONS ON THE CUMULATIVE DISTRIBUTION 

1. THE PROBLEM 

A class of statistical problems of frequent occurrence arises from situa¬ 
tions in which a characteristic is conceived to be normally distributed 
among the individuals of a population, but measurement of any one 
individual can show only whether the characteristic is above or below a 
certain level. The most familiar form of this problem concerns the 
lethality of varying doses of drugs. The electric impulse necessary to 
ignite a charge of powder in a certain system represents another such 
problem; for, although in this case it is possible to apply increasing 
impulses until ignition occurs, there is some evidence that an impulse 
that fails to cause ignition may alter the sensitivity of the system. In 
such situations a single stimulus of known intensity is applied to each 
individual. The resulting data show the proportions of the individuals 
tested at various intensities whose sensitivity fell below those intensities; 
in other words, the data for each level of stimulus intensity provide one 
observation on the cumulated distribution of sensitivities. These obser¬ 
vations may be analyzed to estimate the mean and standard deviation 
of the sensitivity of the individuals comprising the population. 

The present chapter is concerned with certain aspects of the planning 
of experiments that yield observations on the cumulated distribution 
and are analyzed to determine the mean and standard deviation of the 
distribution. For concreteness, the discussion is given in terms of the 
specific problem of planning an experiment to study the sensitivity of a 
radio proximity fuze. A radio proximity fuze, as its name suggests, 
makes use of radio waves to explode a projectile when and if it comes into 
proximity with a target; unlike time fuzes or point-detonating fuzes, it 
requires neither that a definite time of flight be set in advance nor that 
the target be hit directly. If proximity fuzes of a given model behaved 
identically there would, for fixed conditions, be a definite distance from 
the target within which all fuzes would operate and beyond which none 
would operate. Actually, there are random variations in the fuzes and 
in the impulses returned by the target; so if a number of fuzes pass exactly 
the same distance from a target under the same conditions, some operate 
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and some do not. An important parameter of fuze performance is, 
therefore, the probability that a fuze will operate if it passes within a 
certain distance of the target. 

2. METHOD OF ANALYSIS 

It has been determined from a certain set of 1,211 test firings made 
under relatively homogeneous conditions that the probability that a 
nondefective fuze will operate if it passes within a certain distance of the 
target can be well represented by assuming (a) that each nondefective 
fuze is characterized by a definite sensitivity, or maximum distance at 
which it will function, and ( b ) that these sensitivities are normally 
distributed among fuzes. 

Let S be the sensitivity of a fuze, defined as the maximum distance at 
which that fuze fised under the given circumstances will 
function 

H be the true (population) arithmetic mean sensitivity 
a be the true (population) standard deviation of sensitivity 
Then under the above assumptions the probability that a fuze will have 
a sensitivity between S and S + dS is 

(S-m)* 

e 2*’ dS (1) 

The proportion T(d) of fuzes passing a given distance d from the target 
that operate is then the integral of (1) from d to oo, since every fuze 
whose sensitivity is greater than d will operate at d, while every fuze 
whose sensitivity is less will not. Therefore, 

i C 00 IgrgL a 

m) -,-^y, ~« » 

In firing trials, it is not possible to observe the sensitivity of a fuze. 
Each fuze can be fired only once; all that can be observed is the minimum 
distance from the target and whether the fuze operated in proximity to 
the target. * The observations obtained are therefore observations on 
T(d) f the integral of the normal curve, and not directly on S. 

Methods for analyzing such data have been developed, primarily in 
connection with dosage-mortality curves, f These methods involve writ- 

* Actually, the entire trajectory of the fuze can be observed and also the actual 
point where the fuze operated. However, as these play no role in the problem con¬ 
sidered here, they are ignored. 

t See Bliss, “The Calculation of the Dosage-mortality Curve,” “The Comparison 
of Dosage-mortality Data,” “Estimating the Dosage-mortality Curve,” and “The 
Calculation of the Time Mortality Curve”; also Fisher and Yates, Tables , pp. 8-12. 
(For f&U citations, see Sec. 6.) 
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ing (2) as 

T(d) = 7 ^ Jy e ~ 2d * ( 3 ) 

where 

Y = (4) 

a v 7 

An observed value of T{d) can be converted into an estimate of Y by 

means of tables of the normal probability integral. * The estimate of Y 
may be denoted by y . Now, Y is linearly related to d by 

+ (5) 

<T rr 

Hence the observed data may be analyzed by fitting a straight line to 
the observed values (y.d). The usual direct methods of fitting a straight 
line cannot be used, however, since the variance of y is not a constant but 
depends on Y; instead, $ method of successive approximations is used. 
Tables are available to facilitate fitting the straight line by this method.f 
To avoid negative values, some of these tables use “probits,” defined as 
normal deviates increased by 5, instead of the normal deviates themselves. 

The function (5) was fitted to the data observed in the 1,211 test 
firings on fuze performance mentioned previously. The estimates of /n 
and a turned out to be A and B , respectively.! Figure 11.1 shows the 
fitted function (5), as well as 90 percent confidence limits on it. These 
confidence limits were computed from the equation! 


<ry — 


V—+ <*-a>*-»—- 


2 

t- 1 


W t 


^ Wi(di - d) 2 


» = i 


( 6 ) 


where N = number of observed points to which the straight line was 
fitted 

Wi = weight given to the ^th point in getting the last approximation 
i = weighted average of observed values of d ) that is, 

N N 

w %d%! £ Wi 

t-i 

* For example, Table 1.5 in Chap. 1 of this volume. 

t For example, Fisher and Yates, Tables , p. 49. Another useful set of tables is 
contained in Brown, Tables to Facilitate the Analysis of Sensitivity Data. This shows 
the ordinate Z of a normal curve, the probability P that a given normal deviate will 
be exceeded, Q — 1 — P, Z/PQ, and Z*/PQ, all to 5 decimal places for values of Y 
from 0 to 3 by steps of 0.01. 

t The numerical values of A and B, which cannot be published, are immaterial 
for the present purposes. 

SSee Buss, “The Calculation of the Dosage-mortality Curve.” 
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estimated variance of an observation about the last approxi¬ 
mation, that is, if Yi is the regression value for the tth point 
computed from the last approximation, then 


N 


J - Y<y 


N - 2 


(7) 


The problem then arose of collecting similar data under different 
conditions. Theoretical considerations gave little indication of the way 
in which the performance of the fuze would be affected by the difference 

Normol deviate 



Fiq. 11.1.—Fitted function with 90 percent confidence limits based on 1211 observations. 


in conditions. The scale of the experiment had to be limited to at most 
260 rounds. The statistical problem, with which the rest of this chapter 
deals, is that of distributing these rounds among various possible distances 
from the target in such a way as to get as accurate an estimate as possible 
of T(ct). This problem serves to illustrate a number of considerations 
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pertinent to such experiments; while no general, theoretical solution is 
offered, the mode of approach should be at least suggestive in similar 
situations. 

3. STATISTICAL CONSIDERATIONS AFFECTING THE PLANNING 
OF THE EXPERIMENT 

A paradox frequently arises in planning experiments. The most effi¬ 
cient design depends on the true value of the parameter under investiga¬ 
tion, but if this were known there would be no need for the experiment. 
This paradox is usually evaded by basing the experimental design on some 
advance estimate of the parameters and trying to adjust the design so 
that the efficiency of the experiment is not mush affected by minor 
differences between what is true and what is assumed. This method 
itself raises difficulties, for example, how much of the efficiency that the 
optimum design would Lave ii the advance estimate were correct should 
be sacrificed to gain efficiency if th* advance estimate is not correct? 

This difficulty is present in marked degree in the problem under 
consideration and is aggravated by another difficulty, namely, that two 
parameters are to be estimated and the experimental design that is best 
for estimating one is not best for estimating the other. The two param¬ 
eters to be estimated are /z and <r, or, what is equivalent, the slope and 
intercept of (5). The experiment would be most efficient if it could 
minimize both the variance of the estimate of n and that of the estimate 
of cr; unfortunately, the distribution of observations that minimizes the 
variance of the estimate of /z is decidedly different from that which 
minimizes the variance of the estimate of <r. 

If <r were known, the variance of the estimate of /z would be minimized 
by taking all observations at a single distance equal to /z. But since a is 
not known, it would be self-defeating to take all observations at a single 
distance, because (a) this would make it impossible to correct the advance 
estimate of /z, and ( b ) it would give no information at all about a. If the 
observed value of T(d) were 50 percent, the estimate of p from the experi¬ 
ment would be equal to the value of d at which the firings were made, 
that is, it would be the same as the advance estimate. If the observed 
value of T(d) were other than 50 percent, the experiment would indicate 
that the advance estimate of /z was wrong but no new estimate could be 
made. Only one point on the straight line (5) would be available, so its 
slope could not be estimated; hence, it would be impossible to estimate the 
value of d at which T(d) = 50 percent. These statements are equivalent, 
of course, to saying that no estimate of a would be possible either. The 
most that could be learned from an experiment with all observations at an 
estimated g would be whether this estimate was correct. 

The variance of the estimate of <r would be minimized by taking all 
observations at two values of d, approximately 1.6 times the estimated 
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value of <r on either side of the estimated value of /*• Even though this 
would not give as accurate an estimate of m as could be obtained by 
placing the observations closer to m, it is a workable arrangement in that 
it would yield estimates of both m and a even though the advance esti¬ 
mates were in error. 

From these considerations alone, the appropriate compromise would 
appear to be to place the observations in two groups located symmetri¬ 
cally about the estimated n but closer to it than the distance that would 
minimize the variance of the estimate of <r. To decide how much closer 
would require deciding how much it is worth sacrificing accuracy in <r to 
gain accuracy in /z. 

Other considerations must, however, be taken into account. Placing 
all the observations at two points puts great reliance on the validity of 
equation (2) as a description of T(d). This relation was derived from a 
single body of data, for which its validity cannot be considered conclu¬ 
sively established; it may, therefore, not be valid for the new experiment. 
Observations at only two values of d cannot, however, show whether it is 
valid or not, since there will be only two points to which to fit (5), and a 
straight line can always be fitted perfectly to two points. In order to 
determine whether (2), and hence (5), describe the data reasonably well, 
it is necessary to spread the observations among more values of d. The 
desirability of spreading the observations was reinforced in the present 
experiment by the fact that the data were to be used for other purposes, 
in particular, to study the location of points of functioning, for which a 
wide spread seemed desirable. 

Other considerations to be taken into account in the particular experi¬ 
ment were (1) it was not feasible to use distances greater than about 
A + %B) (2) most of the firings in the earlier experiment had been at 
short distances, so that it seemed desirable to have some firings in the new 
experiment at short distances in order to make a direct comparison of the 
results of the two experiments, independent of any assumptions about the 
form of T(d). 


4. POSSIBLE DESIGNS 

In the light of the preceding considerations, five alternative distribu¬ 
tions of 260 observations were considered. These are shown in Table 
11 . 1 . 

Plans 1 and 2 are suggested by considering how to minimize the 
variance of the estimate of a. The advance estimates, based on the 
previous experiment, were A for /z and B for <r. The 250 observations in 
Plans. 1 and 2 are divided into two groups and placed symmetrically on 
either side of the mean. In Plan 2, they are about as far apart as is 
required to minimize the estimate of a; in Plan 1, they are brought closer 
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together to get a more accurate estimate of /x. Plan 3 spreads the obser¬ 
vation evenly over the five classes spanned by Plan 2; it is symmetrical, 
and its largest distance is the largest that is feasible. Plans 4 and 5 are 
attempts to get some data for the short distances for comparison with the 
previous experiment. In Plan 4, the observations for the low distances 
are obtained by omitting those at A, since if A is the true mean, such 
firings give no information about <r. In Plan 5, the observations for the 
low distances are taken equally from the five groups in Plan 3. 

To measure the relative efficiency of these five possible designs, 90 per¬ 
cent confidence limits on (5) were computed for each, assuming that the 
correct straight line was that obtained in the earlier experiment, 

r ~~i + h d <» 

The confidence limits for Plans 1 and 2 are plotted in Fig. 11.2, together 
with the confidence limits* obt uned from the earlier experiment (shown 
also in Fig. 11.1).* Table 11.2 shows these confidence limits and the 
assumed correct values in somewhat different form, namely, in terms of 
100 T(d), the percentage of nondefective fuzes. 


TABLE 11.1 

Alternative Distributions of 250 Observations 


Distance 








1 

2 

3 

4 

5 

• 






A - *%B 




20 

20 

A - i %B 





30 

A - i%B 


125 

50 

■ 

40 

A — %B 

125 


50 


40 

A 



50 


40 

A +%B 

125 


50 


40 

A + 


125 

50 

50 

40 


Two points are fairly clear from Fig. 11.2 and Table 11.2 (1) Any 

of the five plans for 250 observations gives as much information as, if not 
somewhat more than, the 1,211 observations used in the earlier experi¬ 
ment; (2) the five possible plans do not differ substantially in efficiency. 
Plan 1, which emphasizes most strongly the accuracy of the estimate of /x, 
gives therefore the narrowest confidence limits for distances near the 
mean. Plan 2, which emphasizes most the accuracy of the estimate of a, 
gives the narrowest limits for distances far from the mean but the widest 

* The confidence limits for Plans 3, 4, and 5 are omitted from Fig. 11.2 only 
because they are so close to those for Plans 1 and 2 that if all five were plotted on a 
figure the size of Fig. 11.2, it would be impossible to distinguish the five lines. 
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limits for distances near the mean. The other plans are in general 
intermediate. 

On the basis of this evidence, it was decided to recommend Plan 5. 
This plan covers the largest number of values of d and therefore comes 
closest to satisfying objectives other than the minimization of the 
variance of estimates of m and <r; at the same time, it does not sacrifice 
much efficiency in the estimation of y and of a. 


Normal deviate 



Fia. 11.2.—Ninety percent confidence limits on assumed function for two experi¬ 
mental designs utilizing 250 observations, and for preliminary experiment based on 1211 
observations. 


5. TECHNICAL NOTES* 

The problem of determining the optimum distribution of observations 
for estimating n and a is considerably simplified by the following two 
requirements, which seem plausible and will be accepted here without 
rigorous proof: 

* This section presumes a knowledge of the methods of analyzing probit data (see 
References, Sec. 6). 
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(1) The symmetry of the normal distribution implies that the opti¬ 
mum distribution of observations will be symmetrical about the true 
mean ft. Note that replacing d — ft by its negative will have no effect 
on variances, since these involve only (a) the weights, which depend only 
on the absolute value of (d — ft)/a, and (6) the squares of quantities like 

d — ft. 


( 2 ) Since the problem is one of fitting a straight line containing two 
parameters, the optimum distribution will involve placing all the obser¬ 
vations at two values of d. These two values of d enter into the variances 
in'the form of linear combinations of weighted squared deviations. If 
one pair of * alues, di and d 2i such that d± — ft = — (d 2 — /a), contributes 
more to the minimization of the variance than another pair of values, say 
dz and d 4 , such that dz — ft = — (c ? 4 — m), it will continue to do so no 
matter how many observations are shifted from dz and d 4 to d\ and d 2 . 

These two assumptions reduce the problem to one of determining a 
pair of values di and d 2 such that d t — ft = —(d 2 — /u), at each of which 
half the observations are to be placed. 

Let p — population mean 

<r = population standard deviation 
m = estimate of ft , 

s = estimate of <r 

Yi = An* (i = l, 2) 
c 


Vi = 



= estimate of 


Yi 


Ti = —- / e 2 dx = true proportion above di 
V2t J Yi 
U = estimate of 

Observed values of U give, from tables of probits or of the normal dis¬ 
tribution, estimates of and hence two equations 


d\ — m 

(9) 

y '= s 

d 2 — m 
yi=! 8 

(10) 

Solving for m and s gives 


_ Vidt — ysdi 

Vi - 2/S 

(11) 

d\ — d% 

8 = - 

yi - yi 

(12) 

It is known’’ 1 that 


_ U 1 - Ti) 

Vi n<2? 

(13) 


wherG 

* See Bliss, “The Calculation of the Dosage-mortality Curve.” 
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tii — number of observations at <U 
1 

2 . = --*= c 2 
-s/ 2 ir 

To a first approximation, 

"‘ m " («s),^r, 4 + (IS).,-., ° 4) 

" ! ~(w\ + (l6) 

The partial derivative!? in (14) and (15) may be evaluated as follows: 

(16) 

(17) 

(18) 
(19) 


dm 

_ Vt(d i - 

-d t ) 

dyi 

( 2 /i - 

y 2 ) 2 

dm 

_ -~yx(d 

1 - d, 

dy. 

" (yi - 

- yiY 

ds 

_ —{dx 

- d,\ 

dyi 

( 2 /i “ 

yi )" 2 

ds 

dx- 

di 

dyi 

( 2 /i - 

yt) 1 


Hence 


( 20 ) 

( 21 ) 


, _ Y i(di - d,y ri(l - Ti) Y\(di - d 2 ) ! T 2 (l - T,) 

(Fi - F 2 )< n l2 ? (F, - F 2 )< n#\ 

, _ (& - ri(i - r.) , (d, - d 2 ) 2 r,(i - r 2 ) 

ff * (Fi - F 2 )‘ n l2 f + (Fi - F 2 ) 4 nrfl 
Substituting for Fi and F 2 in (20) and (21) and simplifying gives 

( 22 ) 

(23) 


v 2 \ tJ \, Ti(i - Tt) , fJ „r 2 (i-r,)l 

" L ( * ‘ ^ »,»! + (d ‘ ~ f) wj J 

, „• rr.(i - to r,(i - ir,)l 

* " (d, - d 2 )’ L nxzl ^ n#\ J 


So far, no use has been made of the symmetry requirement. Equa¬ 
tions (22) and (23) are therefore valid for any pair of values of dx and d t . 
Symmetry implies that 

»i = n t (24) 

(di — n) - — {dt — n) (25) 

Tx = l - T, (26) 

- z* (27) 


From (25), 


dx — dt = 2(di — n) 


(28) 


Substituting into (22) and (23) gives 
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, Ti{l - TQ 
m 2 m*\ 

, o' 4 Ti( 1 - Ti) _ (r* 3*1(1 - Ti) 

2(di - M ) 2 “ 2F? »iif 


(29) 

(30) 


<r^i will be a minimum if Ti( 1 — rO/zf is a minimum with respect to 
di or, what is the same thing, with respect to Y\. But T i(l — Ti)/z{ is a 
minimum at Y\ = 0, as can be demonstrated by differentiating or, more 
simply, by a glance at the tabled values of z\/Ti(l — T\)* Hence, to 
minimize the variance of m, with <r assumed known, it would be necessary 
to put all the observations at p. 

(j\ will be a minimum if T i(l — T^)/z\Y\ is a minimum. Differentiat¬ 
ing yields an equation that would have to be solved numerically. Since, 
as noted above, z\/[T i(l — Ti)\ is tabled, it is simpler to compute directly 
Y\z\/[Ti{\ — T 1 )] and determine numerically the value of Yi at which it 
reaches a maximum. This, gives approximately 1.574 or 1.575 as the 
value of Y for which a] is a minimum.! Hence, the variance of s will be 
minimized by putting half the observations at d\ = p + 1.57<r, and half 
at d% = m — 1.57<r. 
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CHAPTER 12 


ABANDONING AN EXPERIMENT PRIOR TO COMPLETION 

1. INTRODUCTION 

Every experiment is designed in the light of certain presumptions 
about the underlying phenomena to be studied, if only the presumption 
that the phenomena warrant study. It is not always possible, however, 
to confirm these presumptions in advance. In such cases it may be 
desirable to introduce a sequential element into the design of the experi¬ 
ment so that the experimental program can be adjusted to any facts 
about the basic assumptions $ -closed during the course of the investiga¬ 
tion. The present chapter considers a simple design of this type, where 
an experiment is to be undertaken to estimate a parameter of a continu¬ 
ous distribution, but provision is to be made to abandon the experiment 
prior to completion if the accumulated evidence at any stage reveals that 
the underlying population does not warrant study. 

1.1. Nature of the Problem 

The circumstances giving rise to the need of an experiment that 
provides for possible abandonment are these: A new mechanism has been 
constructed whose performance, in functioning models, is thought to be 
superior to that of similar standard mechanisms. It is accordingly 
desired to determine the average and variability of a particular perform¬ 
ance characteristic from an experiment of sufficient size to give reliable 
estimates of these parameters. There is reason to fear, however, that the 
proportion of breakdowns may be prohibitively high, in which case the 
mechanism cannot be used in its present state of development and 
the estimation of its performance parameters is of little value. More¬ 
over, because the experiment will be costly in time, personnel, and equip¬ 
ment, considerable saving will result from discontinuing the experiment 
at an early stage if the mechanism exhibits too high a failure rate. An 
experiment is therefore desired that incorporates the following provisions: 
(1) If the failure rate exceeds a certain tolerance standard, there must 
be a preassigned degree of assurance of rejecting the mechanism and 
terminating the experiment; (2) if the failure rate is below a certain 
acceptable standard, there must be a preassigned degree of assurance of 
completing the experiment; (3) the experiment should cease at the 
earliest point at which the accumulated evidence is sufficient to reject 
the mechanism with the required degree of assurance. 

355 
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1.2. Given Features of the Experiment 

Previous investigation of the mechanism appears to justify the 
assumption that the performance characteristic under study is normally 
distributed about some unknown average value. Moreover, from pro¬ 
visional figures it appears that a satisfactory balance between the cost of 
the experiment and the precision of the estimates can be obtained by a 
2 X 2 X 2 X 2 factorial experiment involving duplicate observations, 
that is, from 32 observations. Certain details of the experiment can thus 
be specified at once. If the mechanism has an acceptable failure rate, no 
fewer than 32 items can be tested, since 32 measurements of performance 
are required; on the other hand, no more than 32 functioning items need 
be examined, since these will yield the required number of measurements 
for estimation with the minimum amount of testing. Thus the criterion 
for accepting the mechanism can be represented graphically as a straight 
line by plotting the number of items that fail against the number of 
items tested. This acceptance line intersects the base line at 32, showing 
that no fewer than 32 items can be tested; it has a slope of 1, showing 
that if failures occur, the number of items tested shall be only sufficient 
to yield 32 performance measurements. As the experiment proceeds, a 
point can be plotted after each test representing the cumulated number 
of failures and the number of tests made; when such a point reaches the 
acceptance line the experiment is complete, for 32 performance measure¬ 
ments have been obtained. 

1.3. Statistical Problem 

If several hundred trials are made without obtaining the 32 perform¬ 
ance measurements needed to complete the experiment, it is obvious 
that the failure rate is excessive and the experiment ought to be aban¬ 
doned. This suggests the advisability of introducing a rejection line, 
showing, for each number of items tested the number of failures that 
would justify rejecting the mechanism and abandoning the experiment; 
the rejection line should intersect the vertical axis above 0 and have a 
slope less than 1. The two lines together yield a closed region such that, 
if the point representing the number of failures and the number of items 
tested crosses the upper line, the mechanism is rejected and the experi¬ 
ment abandoned; while if the point crosses the lower line, the mechanism 
is accepted and estimates are made from the resulting 32 measurements. 

Since the lower line is fixed by the conditions of the problem, the 
statistical problem that remains is clearly to determine the upper line in 
such a way as to meet preassigned requirements on the probabilities of 
abandonment and completion, with a minimum average amount of test¬ 
ing. No general solution to this problem has been found. The following 
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discussion will show, however, that sequential principles* can be used to 
give a good approximation to the solution. 

To formulate the problem more precisely, we assume that a proportion 
of failures p 2 exists that is considered unacceptable; that is, if the failure 
rate equals or exceeds p 2 , we wish to reject the mechanism. If the failure 
rate is less than p 2 , however, we are willing to accept the mechanism; and 
in particular, if it is less than some value pi, we wish a high degree of 
assurance of accepting the mechanism. 


Number of 
failures (d) 



32 Number tested (N) 


Fig. 12.1. 

2. PROPOSED SOLUTION 

2.1. Approximating the Rejection Line by Sequential Principles 

The use of sequential principles to meet these requirements is sug¬ 
gested by the desire to minimize the necessary amount of sampling. A 
sequential criterion makes it possible to reappraise the failure rate of the 
mechanism after each test and to determine on the basis of the accumu¬ 
lated evidence whether (a) to reject the mechanism, (6) to accept it, or 
(c) to continue the experiment. As soon as the accumulated evidence 
is sufficient for decision (a) or ( b ), the experiment stops and further, 
unnecessary experimenting is avoided. The usual sequential test, how¬ 
ever, consists of both an acceptance criterion and a rejection criterion, 
whereas completion of the test under consideration requires only a 
rejection criterion. What we seek, therefore, is a sequential test which 
makes no provision for acceptance but which achieves certain assigned 
probabilities of rejection for the specified failure rates p\ and p 2 . Having 
found such a test, we can provide for actual acceptance by combining 
* See Chap. 17 for a brief account of sequential analysis. 
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our sequential rejection criterion with the predetermined acceptance 
criterion, taking care to allow for the effect of this combination on the 
total probabilities of rejection. 

The desired sequential rejection criterion can be derived as a special 
case of the ordinary sequential binomial test. The ordinary test can be 
represented on a chart similar to Fig. 12.1 by a pair of parallel straight 
lines having a slope $ and intercepts — hi and h 2 . A sequential test which 
provides only for rejection, that is, which consists solely of the upper 
decision line, may be regarded as a limiting case of an ordinary sequential 
binomial test with hi = «>. This conception enables us to use the for¬ 
mulas for the usual two-decision test to obtain the operating-character¬ 
istic (OC) curve of a one-decision sequential test, that is, the probability 
of acceptance L v as a function of the true population proportion of 


failures p. 

When /ii = oo, the OC curve is given by* 



R P * - 1 

V = , 

when p < s 

o) 


i—( 

1 

1 

a, 



and by 

R p = 1 - L P = 1 

when p > & 

(2) 


Thus, the probability of rejecting is 1 if p > s; it lies between 0 and 1 if 
p < s. Accordingly, we take the value p 2 , representing the failure rate 
above which we want to abandon the experiment, as the slope 5 of the 
one-decision sequential test. Then if a is the risk we are willing to run 
of abandoning the experiment and rejecting the mechanism when the 
true failure rate is only p h we may substitute pi for p, a for R p , and p 2 
for s in equation ( 1 ) and solve for /i 2 -t 

2.2. Adjusting the Rejection Line to Take Account of the Acceptance Line 

Having determined the one-decision sequential test that yields the 
requisite probabilities of rejection, we now take into account the accept¬ 
ance criterion resulting from the conditions of the experiment. This 
acceptance criterion affects the probabilities in two respects. ( 1 ) The 
probabilities of rejection will be decreased, since those samples which 

* See Statistical Research Group, Sequential Analysis, Appendix B , equations 
(B.80) and (B.81). (For full citations, see Sec. 5.) 

f An approximation to hi can be obtained by replacing p a of the formula 



'(Sequintial Analysis, Sec. 2, equation 2.204) by p a + (p* — pi) — 2p a — pi. 
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would lead to late rejection will not be rejected at all (in fact, will now be 
accepted); (2) for every value of p < 1, whether or not p > s, the 
probability of acceptance will be greater than zero. The net effect of 
the acceptance line is thus to increase the chances of acceptance for all 
values of p < 1. This means that (= 1 — R p ) will now be positive; 
while R Pi will be less than a . The decrease in R Pi is, of course, desirable, 
but it can be attained only at the cost of additional testing beyond that 
necessary to maintain R Pi at the required level a. It remains therefore 
to readjust the decision lines so that R Pi will be a and L p% will be at an 
acceptably low level, say /?. 

Since the acceptance line is fixed by the conditions of the problem, the 
necessary changes in the decision lines must be made entirely in the 
rejection line. It is apparent that the probabilities of rejection can be 
increased either by reducing h * or by reducing s, or both. It is not clear, 
however, what combination of adjustments will give the desired risks 
a and ft with a minimum ariov.nt of inspection. We therefore resort to 
trial and error to find a combination of h 2 and s that gives the desired 
probabilities of acceptance, leaving unsettled the question whether this 
combination of h t and a is the best possible from the standpoint of the 
average amount of testing. Fortunately, this trial-and-error process can 
be carried out without too much difficulty, for a straightforward method 
exists for computing the OC curve for any binomial test of the sort here 
described. Thus, by determining the OC curve corresponding to each 
of a number of combinations of h 2 and s, that combination can be chosen 
which gives the requisite probabilities of acceptance at pi and p 2 . This 
procedure will yield a plan for the experiment which, though probably 
not optimum from the standpoint of minimizing inspection, will serve as 
an approximation to the type of plan desired, that is, lead to early 
abandonment of the experiment if the true failure rate is high; moreover, 
the OC curve will be available, so the consequences of using the plan will 
be fully known. 


3. AN EXAMPLE 

To illustrate the method of solution outlined in Sec. 2, let us suppose 
that if as many as half the mechanisms are failures we wish a high degree 
of assurance of abandoning the experiment and rejecting the proposed 
model, if as few as 40 percent are failures we wish a high degree of assur¬ 
ance of completing the experiment. Let the desired probability of 
completing the experiment if the failure rate is as high as 50 percent be 
0 = 0.10, and the probability of abandoning the experiment if the failure 
rate is as low as 40 percent be a = 0.10. 

The lower, or acceptance line is determined by the requirements for 
estimating the performance parameters, as explained above; its equation 
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is 

d\ = —32 + N (3) 

where di represents the cumulated number of failures and N represents 
the number of tests made. 

The first approximation to the rejection line is given by the one- 
decision sequential test for which s = 0.5 and h 2 is obtained from 


Writing 
we have 


Then, from (5), 


_0 1 5 

o.4 

0.1 - 1 


x = 0.1 k ' 

_ X 0.5 _ 1 _ 1 

^ X - 1 x°- s + 1 

X 0 - 6 = 1.5 
x = 2.25 


hi 


— log 0.1 
log X 
1 - 

log 2.25 


= 2.84 


(4) 

(5) 

( 6 ) 

(7) 

( 8 ) 

(9) 


A moment's reflection on the orientation of the line d 2 = 2.84 + 0.5N 
on the graph relating number of failures to number of items tested shows 
that so long as s = 0.5, all values of h satisfying the inequality 2.5 < h < 3 
are equivalent, since observed values of d and N must be integers. In 
view of this, and because we expect this first approximation to yield too 
large probabilities of acceptance, we place the initial value of h 2 at 2.5. 
The first approximation to our test is thus defined by the two straight 
lines with equations 


d 2 = 2.5 + 0.5 N (rejection line) (10) 

di = —32 + A (acceptance line) (11) 

However, this combination fails to give a satisfactory OC curve, the 
values of L p being excessive throughout the range of p. On the other 
hand, the combination h 2 = 1.6 and $ = 0.4 gives values of L p that are 
too small, and so also do the combinations h 2 = 1.5, s = 0.5 and h 2 = 2.2, 
s = 0.4. A fifth set, h 2 = 2.5 and $ = 0.4, gives the following OC 
curve, which, though departing from that originally specified, appears 
satisfactory. 
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Proportion Probability of 
of failures completing experiment 


0.1 

1 . 000 - 

0.2 

0.990 

0.3 

0.904 

0.4 

0.532 

0.5 

0.102 

0.6 

0.003 


4. CALCULATION OF OC CURVE 

The method of calculating the OC curve for any particular experi¬ 
mental plan of the type described is discussed at length elsewhere,* so 
it will be outlined here only in general terms. 

The basis of the method is the fact that in any truncated sequential 
test, the number of possible outcomes in sampling is finite and, in practical 
cases, fairly small. It is i here!ore possible to enumerate all possible out¬ 
comes of the test and to dete^i le the total probability of those outcomes 
which lead to acceptance, that is, the value of L p . 

This calculation is accomplished step by step, as follows: For the first 
stage of the experiment, all possible ways in which the sample might 
turn out are specified, and the probability of each outcome is computed 
on the assumption that the true failure rate of the mechanism is some 
particular value, say pi. These outcomes are then classified into two 
groups, those which lead to acceptance and those which lead to the 
decision to continue testing. The sum of the probabilities of the first 
class gives the total probability of acceptance at the first stage of the 
experiment. Each outcome in the second class is then considered as a 
possible point of departure for the next stage of the experiment; and for 
each such point of departure, the possible outcomes of the second stage 
are considered and classified as before. The sum, over all points of 
departure, of the probabilities of those outcomes which lead to acceptance 
at the second stage then gives the total probability of acceptance at 
stage two, while the sum of such probabilities over both stages gives the 
total probability of acceptance for the first two stages. This basic 
method of calculation can be carried through the remaining (finite num¬ 
ber of) stages of the experiment. 

The total probability of acceptance taken over all stages represents 
the probability that the experiment will lead to acceptance when the 
true failure rate is pi. By systematic methods these calculations can be 
duplicated for a choice of different values of p, so that a sequence of 
values of p and L p sufficient to determine the entire OC curve, or at least 
those points of principal interest on the OC curve, can be determined. It 
is in this fashion that the points on the OC curve in Sec. 3 were determined. 

* Statistical Research Group, Sampling Inspection , Chap. 17, Sec. 2, pp. 187-202. 
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This method of attack yields only one of many possible tests of the 
sort described, all haying the same probabilities L Pi = 0.90 and L v% = 0.10. 
The optimum test may thus be viewed as the particular one of these 
tests which requires the smallest average amount of inspection for the 
relevant range of p. By sufficient calculation this test might be deter¬ 
mined by trial and error, since the probabilities used to determine the 
OC curve of a test suffice to calculate the curve representing the average 
amount of sampling as a function of the failure rate p (the ASN curve).* 
If, therefore, we were to determine a whole class of tests with the same 
L Pi and L Pt , we could choose that one whose ASN curve showed the least 
average cost in sampling in the relevant range of p. However, the 
volume of calculation necessary to use this approach renders it impractical. 
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* For a method of calculating the ASN curve in such cases, see Statistical Research 
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CHAPTER 13 

PLANNING EXPERIMENTS SEEKING MAXIMA 
1. TYPE OF PROBLEM 

This chapter deals almost entirely with a special class of experiments, 
namely, those designed to determine the conditions for which some func¬ 
tion attains a maximum. For example, experiments may be conducted 
on the breaking strength of a type of alloy to determine the composition 
producing maximum strength. It is not the primary object of such 
experiments to determine the functional relation between strength and 
composition, nor even to measure the strength, maximum or otherwise; 
the primary object is to find a particular combination of the independent 
variables at which the maximum of the dependent variable is nearly 
attained. Although the planning of experiments has become a major 
branch of statistics, little attention has been given to this important class 
of problem, which is especially common in engineering and development 
work.* 

For concreteness, this chapter considers the specific problem of 
planning a metallurgical experiment to find the best composition of a 
certain type of alloy for maximizing the time to rupture under a given 
stress at a given elevated temperature. Clearly, the principles elucidated 
in terms of this particular problem are applicable to a wide class of 
problems. 

2. PURPOSES OF EXPERIMENT 

The problem considered is one in which a single observable result 
(number of hours it takes a specimen to rupture) is a function of many 
independent variables (method of treating the alloy, chemical composi¬ 
tion of the alloy, and so forth). An experiment is to be conducted in 
which the result is to be measured for a series of values of the independent 
variables. 

Such an experiment is likely to have two somewhat different pur¬ 
poses: (1) To find the values of the independent variables that make 
the function (that is, the observable result) a maximum; (2) to evaluate 
the function in the neighborhood of the maximum. At the present stage 
of metallurgical knowledge, finding the maximum is usually the primary 
purpose. There is reason to believe that nothing like the maximum 

•The pathbreaking work on the problem is that of Hotelling, “Experimental 
Determination of the Maximum of a Function. ” (For full citations, see Sec. 6.) 
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possible strength has yet been attained; and the theory of the subject 
is so undeveloped as to make direct experiment the most fruitful approach 
toward finding stronger alloys. Moreover, there is relatively little 
interest in knowing the behavior of the function in regions rather distant 
from the maximum. 

If the function has more than one maximum, the ultimate objective 
is to find the largest maximum—the absolute maximum. In general, 
however, this objective is unattainable without complete knowledge of 
the entire function. Such knowledge can seldom be secured experi¬ 
mentally, and almost certainly not in a single experiment. The dis¬ 
cussion that follows therefore assumes a more modest objective, the 
determination of a local maximum. 

Although the location of a maximum is of primary importance, 
mapping the function in the Neighborhood of a maximum is important 
for several reasons: 

(1) Typically, maximization of the particular dependent variable 
measured in a given experiment is only one of several desiderata in 
choosing an alloy, so it is important to know how much would be sacri¬ 
ficed if the independent variables were altered to satisfy other require¬ 
ments. For example, the combination of independent variables that 
gives the highest time to rupture may give insufficient ductility. It 
might then be desirable to know how. much strength would be sacrificed 
by various changes designed to increase ductility. 

(2) Knowledge of the relative importance of different independent 
variables is valuable in designing other experiments to find other maxima. 

(3) Knowledge of the function in the region of a maximum may 
contribute to an understanding of the mechanism at work and may thus 
advance theoretical knowledge. 

(4) It may not be possible to control the independent variables as 
closely in production as in laboratory experiments. Knowledge of the 
function in the neighborhood of a maximum will make it possible to 
judge the seriousness of departures from control. 

3. DEFICIENCIES OF FACTORIAL DESIGN 

The factorial design is ideally suited for experiments whose purpose 
is to map a function in a preassigned range of values of the independent 
variables.* The fact that the region is preassigned means that it is 

* The term “factorial design” is used in this chapter to refer to an experimental 
design in which a limited number of levels of each independent variable are selected 
in advance and at least one test is made for each combination of the selected levels of 
the independent variables. See Fisher, Design of Experiments, Chap. VI, The 
Factorial Design in Experimentation. 

The factorial design is in a sense the extreme example of a class of designs having 
in common the selection in advance of the values of the independent variables at which 
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feasible to lay out the design completely and specifically in advance 
of the experiment; the fact that the behavior of the function is desired 
for all possible combinations of the independent variables means that 
the experiment must represent all combinations in some way, which a 
factorial design does. 

For the purposes described in the preceding section, however, the 
factorial design has certain deficiencies: 

(1) It devotes observations to exploring regions that may turn out, 
in the light of the results, to be of no interest because they are far from a 
maximum. 

(2) Because it seeks to explore a given region comprehensively, it 

must either explore only a small region, or explore a large region very 
superficially. If a small region is explored, it may turn out to contain no 
lbcal maximum. The experiment would then at best indicate the direc¬ 
tion in which to move in conducting a new experiment. If a large 
region is explored, it may t >3 explored so superficially that a maximum 
may be missed entirely, or, if found, will be known so imperfectly that an 
additional experiment is required. * 

(3) The factorial design has the disadvantage, somewhat less funda¬ 
mental than the preceding, that it neglects the fact that some of the 
variables are continuous and some, though discrete, are ordered. This 
deficiency could, however, be remedied while retaining the basic prin¬ 
ciples of the factorial design. 

4. A SEQUENTIAL DESIGN 

These deficiencies of the factorial design suggest that an efficient 
design for the present purpose ought to be sequential; that is, ought to 
adjust the experimental program at each stage in the light of the results 
of prior stages. This section outlines a particular sequential design, 
primarily to illustrate the direction that seems desirable in the design of 
metallurgical experiments of the type here considered. 

Step 1. Select some particular combination of independent variables 
as a starting point. The combination selected should be the best advance 
estimate of the optimum combination. 


observations are to be made. Other designs in this class may, in any particular 
instance, be more economical than a complete factorial and preferable to it. Economy 
can be gained without loss, however, only if additional knowledge is available in 
advance (for example, knowledge that certain interactions are negligible). 

Many of the remarks in thip chapter are applicable to these other designs as well 
as to the complete factorial. However, the chapter considers explicitly only the 
complete factorial design in order to have a sharp and definite contrast and to avoid 
complications and qualifications that have little bearing on the main issue raised in 
this chapter. 
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Step 2. Arrange the independent variables in some order. 

Step 3. Make tests at the original starting point and at a series of 
points differing only in the value assigned the first independent variable, 
the other variables being held constant.* From this series of experi¬ 
ments, estimate the value of the first independent variable that maximizes 
time to rupture, t If the independent variable is continuous, it will 
probably be best to find the maximum by some such device as fitting a 
polynomial to the observations. 

Step 4. Holding the first independent variable fixed at the maximiz¬ 
ing value found in step 3 and the other independent variables, except the 
second, at the values originally assigned to them, repeat step 3 with the 
second independent variable. Repeat this process with the third, fourth, 
. . . , independent variable. 

Step 5. After all variables have been varied (call this a “round” of 
experiments), the whole process can be repeated, except that the original 
starting point is replaced by the combination of values of the independent 
Variables reached at the end of the first round. 

Successive rounds \fill clearly converge to the neighborhood of a local 
maximum, with perhaps a few “pathological” exceptions; for, aside from 
statistical fluctuations, each round and each stjep within a round neces¬ 
sarily represent an improvement. The rate at which the improvements 
decline from round to round is likely to give some idea of how much 
further progress is possible and some indication of the value of additional 
rounds. The process of convergence may perhaps be speeded up after 
one or more complete rounds. For example, after the second complete 

* For the present purpose the precise meaning to be attached to “the other vari¬ 
ables being held constant ” is irrelevant. For variables representing chemical 
composition, it can be interpreted as meaning that the relative proportions of the other 
variables are held constant. Alternatively, for all but one variable, it can be inter¬ 
preted as meaning that the percentage a particular element forms of the total is held 
constant. Any combination of these two will also do, provided the method of holding 
other variables constant is followed consistently. 

t If the independent variable is ordered, this process also could be sequentialized, 
with obvious advantages. For example, tests could be made (a) at the original start¬ 
ing point and (b) at a single other value of the first independent variable. If the 
tests at (b) gave a (significantly) higher time to rupture than the tests at (a), other 
tests would be made by moving still farther in the same direction from the original 
starting point. If the tests at (b) gave a shorter time to rupture, the next tests would 
be made by moving in the opposite direction from the starting point. This process 
would be continued until a value was found higher than its neighbors in both direc¬ 
tions. This aspect of a sequential procedure is not considered in detail here because 
it is not germane to our main point and because practical considerations having to do 
with full use of experimental equipment may make it undesirable to sequentialize the 
experiment fully. Even though this phase of the experiment is not sequentialized, 
there js a real problem about designing it efficiently. See Hotelling, “Experimental 
Determination of the Maximum of a Function.” 
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round it might be desirable to move along the vector defined by the end 
points of the first two rounds, instead of varying the variables according 
to the original pattern. After one such step, the original pattern could 
be used again. 

It will clearly be desirable in the early rounds to use rather broad 
intervals in varying each independent variable and to locate the maximum 
only very approximately. Finer and finer intervals should be used in 
later rounds as the maximum is approached. A consequence of this is 
that the process will provide only a very rough mapping of the function 
in regions far removed from a maximum, but a relatively fine mapping in 
regions near a maximum. The mapping of the rcjdon in the neighbor¬ 
hood of a maximum can, if desired, be filled in by a supplementary 
experiment, perhaps of factorial design, after the neighborhood of the 
maximum has been delimited with sufficient accuracy. 

It may be worth repeating that nothing more is claimed for this out¬ 
line of a sequential design t'ufir. that it is suggestive. Numerous prob¬ 
lems would arise in applying it to an actual experiment, and there are 
many possibilities of increasing its efficiency. The suggested sequential 
design is in large part simply an abstract and systematic statement of a 
rather natural, obvious, and common method of locating a maximum or 
minimum of a function of many variables. 

5. COMPARISON OF FACTORIAL AND SEQUENTIAL DESIGNS 
5.1. Number of Observations 

The great advantage of a sequential design over a factorial design is 
that it will concentrate observations in the neighborhood of a maximum. 
This advantage is likely to be greater, the larger the number of inde¬ 
pendent variables and the larger the number of values of each independent 
variable at which observations are taken. Let n be the number of 
variables and suppose that observations are to be taken at k values of 
each variable (for the sequential design, these need not be the same 
values in successive rounds). Then a complete factorial design without 
replication will require k n observations. A single round of the sequential 
design without replication will require kn observations. The observa¬ 
tions required for a factorial design will therefore 'permit k n /kn = A , * _1 /n 
rounds, which increases with n and with k. The number of rounds 
required under the sequential design will decrease as k increases, since the 
maximum at each step would be determined more accurately. We do not 
know how the number of rounds required under the sequential design 
varies with n—nor indeed is it easy to see how to state this question 
precisely. We expect, however, that the number of rounds required will 
not depend very sharply on n but will depend primarily on (a) the close- 
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ness of the original starting point to the optimum combination and ( b ) 
the size of the experimental error relative to the differences it is consid¬ 
ered important to detect. 

Consider a simple 5 X 3 X 3 X 3 factorial design, that is, one involv¬ 
ing four independent variables, with five values of the first and three 
values of each of the others. Such a design requires 135 observations, 
without providing for replications. A single round of the sequential 
design, covering the same number of values of each independent variable, 
would require 14 observations. It would therefore be possible to make 
almost 10 complete rounds with the number of observations required for 
the factorial experiment. Fewer than 10 rounds should suffice if the 
original starting point is reasonably close to the maximum. More rounds 
might be required if the original starting point were far from a maximum; 
but if so little information about the location of a maximum were avail¬ 
able, a factorial experiment might well prove fruitless. 

As another example, ^consider an experiment to determine the opti¬ 
mum composition of the high-carbon vitallium-type alloys with respect 
to elements that have been included in various tests of this type of 
alloys. At least ten elements have been included in one or another trial 
composition. This gives nine independent variables, since the sum of the 
percentages must add up to 100. Suppose Ve are willing to use only 
three levels of each element. A complete factorial design without 
replications would then require 3®, or 19,683, observations. A single 
round of the sequential design without replications would require only 
27 observations, so that it would be possible to make 729 rounds with 
the number of observations required for a complete factorial design. 
Five rounds, with duplicate observations at each level on each round, 
would require only 270 observations, or less than 1.5 per cent of the 
number required for a complete factorial design without replications. 
As this example indicates, the^ advantage of the sequential design in 
terms of number of observations is likely to become extremely great as the 
number of variables increases. And, it should be noted, the metal¬ 
lurgical problem in question does in fact involve many variables. The 
example cited is certainly conservative, since, in addition to the ten 
chemical elements, the strength of the alloy is likely to be affected by the 
solution and aging treatments, the methods of casting or of forging, and 
so forth. 

It should be recognized that the additional observations obtained 
in the factorial experiment are not useless. They provide information 
about the behavior of the function in regions removed from a maximum, 
about the relative importance of the variables, and about their inter¬ 
actions. This information is likely to be valuable in choosing new 
regions within which to search for maxima and in deciding on the vari- 
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ables on which to concentrate in preliminary experimentation. More 
important still, the observations some distance from a maximum may, 
under some circumstances, tell something about the behavior of time to 
rupture in the neighborhood of that maximum; perhaps the most com¬ 
mon circumstances when this occurs are when (a) interactions are small, 
or ( b ) there is available a fairly good approximation to the mathematical 
form of the function, involving relatively few parameters. 

With respect to proviso (a), work on metallurgical problems suggests 
that interactions rarely are small. Furthermore, while the efficiency 
of a factorial design for finding a maximum is greater if there are no 
interactions than if there are interactions, so it that of a sequential 
design. If there are no interactions, a single round of the sequential 
experiment will, except for statistical fluctuations, lead directly to the 
maximum. Statistical fluctuations mean that successive rounds will not 
lead to exactly the same point; but they will tend to give much the same 
time to rupture, and it shotjpd l>ecome clear fairly quickly that one is in 
the immediate neighborhood of a maximum. Moreover, in the absence 
of interactions, data from successive rounds can be combined to give a 
better estimate of the maximum. 

Proviso (6), that there be available a good approximation, involving 
relatively few parameters, to the mathematical form of the function, is 
rarely satisfied. But if it is, neither the standard factorial design nor the 
sequential design suggested in Sec. 4 is appropriate. The problem then 
is to determine as accurately as possible the values of the independent 
variables that maximize this function. These maximizing values depend 
upon the parameters of the function; the optimum design for this pur¬ 
pose thus depends on the form of the function and on the approxi¬ 
mate values of its parameters. Probably the best plan is to conduct a 
small preliminary experiment to get a general idea of the values of the 
parameters and to use this information in designing a larger experiment.* 

5.2. Time Trends 

One real advantage of the factorial design is that it is possible, through 
proper randomization, to prevent the introduction of bias from any time 
trends that may be present. Time trends will, indeed, reduce the power 
of the factorial experiment to detect the effects that it is designed to 
measure, but they will not bias the conclusions. The sequential plan, on 
the other hand, is very much at the mercy of time trends; and great care 
would have to be taken to see that none crept in. Measures that might 
be adopted would include (a) running control specimens of constant 
composition along with the test specimens; (6) keeping a running record 


* See Hotelling, “Experimental Determination of the Maximum of a Function/' 
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of differences between duplicate specimens to detect changes in experi¬ 
mental error. 
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CHAPTER 14 


PROBABILITY THAT SAMPLE MEANS ARE IN OPPOSITE ORDER 
TO POPULATION MEANS 

1. INTRODUCTION 

When N objects are drawn independently at andom from each of 
two populations in which some measurable characteristic x of the objects 
is distributed about mean values mi and m 3 , what is the probability that 
the larger of the two sample means and x 2 will correspond with the 
population having the smalle: mean? In other words, if a decision 
about the s^jn of mi — m* is based on the sign of xi — x 2 , what is the 
probability that the decision will be wrong? 

Such a question arises if two alternative sources Si and S 2 (for exam¬ 
ple, processes for making, procedures for treating, genetic strains) yield 
objects of a certain kind, and the worth of each object is determined by 
the value of some measurable characteristic x that varies about an aver¬ 
age mi for objects from Si and about an average m 2 for objects from S 2 . 
If mi and m 2 were known, a decision to adopt Si or S 2 would be deter¬ 
mined (completely) by the sign of mi — m 2 ; that is, Si or S 2 would be 
chosen according as mi exceeds or does not exceed m 2 . When mi and m 2 
are unknown, it is natural to base the decision on the sign of xi — £ 2 , 
where Xi and x 2 are the average values of x for N objects drawn inde¬ 
pendently at random from Si and S 2 , respectively. Such decisions will 
be wrong, that is, opposite to the decision that would be made if mi and 
m* were known, whenever the sign of xi — x 2 is opposite to that of 
mi — m 2 . Consequently, it is of interest to know the probability of such 
an occurrence and how this probability depends on the sample size N 
and on the absolute magnitude of the true difference, |mi — m%\. 

In Sec. 2 the probability of Xi — x 2 having its sign opposite to that 
of mi — m 2 is evaluated, and its dependence on N and |mi — m 2 | studied, 
for the case where x is normally distributed about mi with a standard 
deviation of 0*1 for objects from Si, or about m 2 with standard deviation 
a 2 for objects from S 2 . In Sec. 3 attention is given to the question of 
how large N should be in order that the risk of a wrong decision be less 
than (3 whenever |m 2 — mi| exceeds D, say. The results given in Secs. 2 
and 3 are applicable, to a good approximation, in cases where x is non- 
normally distributed for objects from S\ or S 2 , provided the distributions 
of x have finite standard deviations, which is generally the case in prac- 
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tical work, and N is sufficiently large. For sample sizes of 4 or larger 
the approximation will generally be adequate for most practical purposes 
when <n and <r 2 are known. When <n and <r 2 are unknown and N is not 
less than 30, <ri and <r 2 may be replaced by their estimates from the 
samples 


/n l 4 - (l x {i y 

1-1 _ 1-1 

N(N - 1) 


(*• - 1, 2) 


and the results of Secs. 2 and 3 will not be seriously affected. In Sec. 4, 
the conditions under which the sign of xi — x 2 provides the “best” 
decision function for choosing between Si and S 2 are stated, and the 
problem of choosing a best decision function in other circumstances is 
discussed briefly. 


2. PROBABILITY FUNCTION 

Let x\ and x 2 denote the means of N observations drawn independently 
at random from each of two normal populations with means mi and ra 2 
and standard deviations ai and <r 2 , respectively. Then in repeated 
sampling x\ — x 2 will be normally distributed about a mean mi — m 2 

TABLE 14.1 

Probability That Sample Means Will Be in Opposite Order to 

Population Means for Selected Values of 6 and N 


Absolute difference 

between population_ 

means in <r units* 


Number of observations in each sample ( N ) 



0.500 0 
0.472 0 
0.444 0 
0.416 0 
0.389 0 
0.362 0 
0.336 0 
0.286 0 
0.240 0 
0.144 0 
0.079 0 
0.039 0 
0.017 0 
0.007 0 
0.002 0 
0.0002 0 
0.0000 



* Lefeffti and v\ denote the mean and standard deviation of the first population, ms and tr\ denote the 
mean and standard deviation of the seeond population, and let v* — (<ri* + vt*)/2; then & — |mt — ms|/v. 
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with standard deviation a \/ 2/N where 


g.2 _ *1 + A 

2 

(2) 

Consequently, the probability that the sign of xi 
to that of mi — ms is 

— Xi will be opposite 


(S) 

where 


s __ |toi — m 2 1 

(T 

(4) 

and 


* In 

(5) 

a - a yjj 


P(8,N) is tabulated to $ dec 7 ’mala (occasionally 4) in Table 14.1 for 
8 ~ 0 to 0.30 by steps of 0.05, 0.30 to 0.50 by steps of 0.10, 0.50 to 2.00 
by steps of 0.25, 2.00 to 3.00 by steps of 0.50; and for N = 2 to 12 by 


Probability 



Fia. 14.1.—Probability that means of samples from two normal populations will be in 
opposite order to population means. 
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steps of 2, 16, 20, 50, 100, 200. For example, P(0.50,10) = 0.132, which 
signifies that, with samples of 10 observations from each of two popula¬ 
tions whose means differ by a/2, the probability of the sample means 
being in opposite order to the population means is 0.132. This event 
may be expected to occur, therefore, more than 1 time out of 8 in the 
long run, under such circumstances. Clearly it is risky to base decisions 
on the sign of xi — x% for samples of only 10 observations each when 
differences of <r/2, or less, between population means are of practical 
importance. 

In order to portray better the dependence of P(5,N) on 5 and on N, 
P(6,N) has been plotted against 5 in Fig. 14.1 for N = 2, 4, 10, 20, 50, 
100, and 200. From these curves it is clear that decisions based on the 
sign of xi — x 2 will rarely be incorrect when \rrii — m 2 | exceeds 2a ( 8 = 2), 
even for samples of only 4 observations each. On the other hand, if an 
incorrect decision is of practical importance when |rai — m 2 | = a/2, and 
if it is desired that the probability of an incorrect decision not exceed 
0.01 under these circumstances, then about 50 observations from each 
population will be needed as a basis for decisions. 

3. SAMPLE SIZES 

* 

If we choose a probability /3 of making an incorrect decision (a 
decision based on the sign of Xi — x 2 different from that which would be 
made if the sign of mi — m* were known) and define K 9 by 

- 4 * / « 2 * - « ( 6 ) 

V2tt J Kt 

then, from (3), (4), and (5) it follows that 

1=L ~ l VI- K « 0) 

determines, for a given N and a, the value of \rrii — m 2 | for which the prob¬ 
ability of an incorrect decision is fi; or, for a given value of \nt\ — m 2 |/<r, 
the value of N that will be needed if the probability of an incorrect 
decision is to be 0. Values of Kp can be obtained from Table 1.1 of 
Chap. 1, or values of P corresponding with various values of K from 
Table 1.5. 

For example, if an incorrect decision is serious when inti — m 2 \ = a /2, 
and if it is desired to limit the risk of such an error tb 0.01 under these 
circumstances, then (7) implies that at least 44 (8 X 2.3263 2 = 43.3) 
observations will be needed from each of the populations. On the other 
hand, if the sampling has already been carried out, taking only 30 
observations from each population, it may be some consolation to 
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know that the risk of an incorrect decision is only 0.05 when 
\m x - m«| = 1.6449<r/\/T5 = 0.42<r 

and is 0.01 when 

jmi — m 2 1 = 2.3263 (t/\/T 5 = 0.60o\ 

4. CONDITIONS UNDER WHICH THE SIGN OF 2i - $ 2 PROVIDES THE BEST 

DECISION FUNCTION 

In Sec. 1 it was stated that if a decision to choose Si or S 2 would be 
determined (completely) by the sign of mi — m 2 if this sign were known, 
it is natural to base the decision on the sign of %i — x 2 . This raises the 
question of whether the sign of X\ — x 2 is actually the best function of the 
data upon which to base the decision 

H. A. Simon has shown that when (a) x is normally distributed in 
each of the papulations St ai$ 2 , (b) the standard deviation of x is the 
same (<ri = at) in Si and S 2 * c) there is no a priori basis for a choice 
between Si and S 2 without knowledge of the values of mi and m 2 (or, at 
least, of the sign of mi — m 2 ). and (d) the choice between Si and S 2 is to 
be based on some symmetric function of the observed values of an equal 
number N of observations drawn independently at random from Si and 
S 2 , then the probability of an incorrect decision will be a minimum, what¬ 
ever the values of mi and m 2 and of the common value of ai and a 2) if the 
decision to choose S\ or S 2 is based on the sign of xi — x 2 . As we have 
seen above, the probability of an incorrect decision, when the sign of 
xi — x 2 provides the basis for decision, is determined completely by N, 
a = ai = a 2y and |mi — m 2 | under these conditions. 

When conditions (a), (c), and ( d ) above are fulfilled, but a x a 2y 
then, Simon has found, there does not exist any symmetric function of the 
2 N observations having the property that when the decision to choose 
between Si and S 2 is based on the observed value of this function the 
probability of an incorrect decision will be a minimum whatever the 
values of m x and m 2 ; this is true even when the values of a 1 and a 2 are 
known. This is the case a fortiori when the value of either ai or <r 2 is 
unknown. He has found, however, that if it is agreed that (d') the choice 
between Si and S 2 is to be based on some symmetric function of the 
differences xn — x 2 i, xi 2 — x 22) • • • , Xi N — x 2n , then the sign of 
xi — x 2 is the decision function that minimizes the probability of error, 
whatever the values of mi and m 2 and of a 1 and <r 2 . 

When x is non-normally distributed in one or both of the populations, 
then xi — x 2 no longer furnishes the most efficient estimate of mi — m 2 
and the sign of xi — x 2 will not be the best decision function upon which 
to base a choice between Si and S 2 , even when a\ = cr 2 . Welch has dis¬ 
cussed the determination of the best discriminant function upon which 
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to base a choice between two populations when the form of the distribu¬ 
tion of x in each of the populations is known. 
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CHAPTER 15 


SIGNIFICANCE OF THE LARGEST OF A SET OF 
SAMPLE ESTIMATES OF VARIANCE 

1. TYPE OF PROBLEM 

When data relate to several different processor, to several different 
products, to a single process at different times, or to a particular type of 
product from each of several different sources, the analyst is frequently 
faced with the necessity of deciding whether the observed differences in 
variability of the data witjiin *he respective classes are consistent with 
the supposition that the intrinsic variability is uniform, that is, the same 
for each class. 

If the classes are not uniform in variability, then simple comparisons 
of the classes in other respects may be misleading. Thus, if the within- 
class variability differs greatly from class to class, the mean values for 
the respective classes of the variable under consideration no longer pro¬ 
vide an adequate basis for comparison, since a small mean associated with 
a large dispersion may imply greater frequency of occurrence of undesir¬ 
able values of the characteristic than will a larger mean associated with a 
smaller dispersion of individual values. In such cases it may be more 
useful to compare the classes with respect to the probability of occurrence 
of an individual value greater than a specified upper limit U (as in 
Chap. 1) or with respect to some other feature that takes into account 
both the average value and the variability of the characteristic in each 
class. In this connection it should be recalled that the customary F test 
of the homogeneity of class means, which consists of comparing the 
observed ratio of the betwcen-class mean square to the pooled within-class 
mean square, is valid only when the intrinsic within-class variability is 
the same for each class. Indeed, a pooled estimate of within-class 
variability, obtained by combining measures of the respective within- 
class dispersions, may be useless for practical purposes, unless there is 
homogeneity of within-class variability or the method of combining the 
within-class dispersions takes proper account of the relative importance 
(for example, the relative frequency of occurrence) of the respective 
classes under realistic conditions. 

Of the various criteria that have been advanced for testing the homo¬ 
geneity of a set of variance estimates, the test proposed by Bartlett is 
generally considered to be the best one available of the “omnibus” 

385 



386 


TECHNIQUES OF STATISTICAL ANALYSIS 


Chap. 15 


variety.* An omnibus test is one that has good discriminating power 
with respect to a large variety of alternatives to the null hypothesis. 
The customary F test of the homogeneity of a set of means is often used 
as an omnibus test. This is the case when there are k means and the 
between-means mean square having k — 1 degrees of freedom is com¬ 
pared with the error mean square. But “the statistician who does not 
know in advance with which type of alternative to [homogeneity] he may 
be faced is in the position of a carpenter who is summoned to a house to 
undertake a job of an unknown kind and is only able to take one tool 
with him! Which shall it be? Even if there is an ‘omnibus* tool, it is 
likely to be far less sensitive at any particular job than a specialized one; 
but the specialized tool will be quite useless under the wrong conditions. ”t 
It is therefore advisable in applications of the F test to isolate and test 
individually the mean squares corresponding to comparisons that have 
special interest a priori, or that represent comparisons between rational 
subgroups of the data that may have unequal (population) means for 
various a priori reasons. By thus adapting the F test to the circum¬ 
stances of the study involved, its discriminating power is improved, at 
least with respect to the alternatives to homogeneity that are believed 
to be the most likely to exist or to be of special importance if they do 
exist. Bartlett has pointed out that his over-all test for homogeneity of 
a set of variances can be separated, in similar fashion, into parts represent¬ 
ing comparisons of special interest between subsets of variance estimates, 
and the statistical significance of each part tested individually. J Like 
the F test for homogeneity of means, Bartlett’s test has been shown to be 
unbiased and to possess various other desirable properties. § When the 
issue is homogeneity of variances, Bartlett’s test assumes the role played 
by the F test when the issue is homogeneity of means. 

Bartlett’s test, though applicable, is somewhat inappropriate when 
the alternative to homogeneity that is most likely to represent the true 
situation, or that is of special importance when it does obtain, is that the 
expected value of some one of the mean squares (variance estimates) 
substantially exceeds the expected values of the others, which may be 
homogeneous or not. By “some one” it is meant that prior to the 

* Bartlett, “Sufficiency and Statistical Tests,” especially p. 273. Also see 
Bishop and Nair, “Homogeneity of a Set of Estimated Variances Pitman, “Loca¬ 
tion and Scale Parameters”; and Hartley, “Homogeneity of a Set of Variances.” 
Bartlett’s test is described in the next section of this chapter; also in Rider, Statistical 
Methods , Chap. 7, Sec. 51, where an illustration is given. (For full citations, see 
Sec. 5.) 

t Pearson, “Efficiency of Statistical Tools,” p. 316. 

t Bartlett, “Examples of Statistical Methods,” especially pp. 153-159. 

$ Pitman, “Location and Scale Parameters,” especially Secs. 2 to 6; also Hartley, 
“Homogeneity of a Set of Variances.” 
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analysis of the data—or, better, prior to the collection of the data—there 
is no basis for identifying the anomalous one of the set. The several 
variance estimates may correspond, for example, to the several machines 
of a production lipe and represent the within-machine variability of 
products produced by the respective machines, which variability may be 
expected to be the same from machine to machine unless one of the 
machines is out of adjustment, in which event the variability of products 
from this machine will exceed that of products from other machines. 
Similarly, the several variance estimates may correspond to samples 
taken from a production line at various times during a shift and may be 
expected to be homogeneous if the process is in control but to have a 
high value if the process was out of control on one occasion. In situa¬ 
tions of this type what is needed is a test of the statistical significances 
of the amount by which the largest of the variance estimates exceeds 
the others. Cochran has developed such a test and has provided a small 
table of 5 percent significance levels to facilitate its use.* 

Cochran’s test for the statistical significance of the largest of a set of 
variance estimates is described and its use illustrated in Sec. 3. Table 
15.1, which gives 5 percent levels of the test, is an extension of Cochran’s 
table. Table 15.2 gives 1 percent levels, which were not tabulated by 
Cochran. In view of the approximations used in their computation, 
many of the entries in these tables are not accurate to 4 decimal places, 
as tabulated. Their accuracy is believed to be sufficient for most prac¬ 
tical purposes, at least until more accurate values are available. For 
discussion of the methods of computation see Sec. 4. 


2. BARTLETT’S TEST 

Let s\, sf, . . . , si be k variance estimates distributed independently 
as y}<s 2 /n for Wi, n 2 , . . . , n k degrees of freedom, respectively, and s 2 be 
the pooled variance of the k samples 

(i) 

n w 

where 

n = (2) 

Bartlett’s test involves computing 


B « i l0ga l0ga 

2.30259/ , , V i 

- — Q —In logio 8 2 — > rii loi 


n logio 8 2 


rii logio a? 


* Cochran, “Largest of a Set of Estimated Variances.’’ 



388 

where 


TECHNIQUES OF STATISTICAL ANALYSIS 


Chap. 15 


c 


i + 


Vl-L 

jL/rn n 
3 (Jb - 1) 


(4) 


Then even for quite small values of the n if say, 5 or more, B has approxi¬ 
mately the x 2 distribution with k — 1 degrees of freedom. Since C > 1, 
it need not be calculated if for C = 1 the value of B is not significant.* 


3. COCHRAN’S TEST 

3.1. Tables^ 

Let $i, . . . , si be k variance estimates distributed independently 

as xW/n for n degrees of freedom each, and let 


9 = 


largest s 2 


5 1 + S 2 + ’ ‘ * + Si 


(*) 


be the ratio of the largest s 2 to their total. Table 15.1 gives values 
of g to 4 decimals that are exceeded with probability P, where 
0.04875 < P < 0.05000, for n = 1 to 10, 16, 36, 144, oo y and k = 2 to 10, 
12, 15, 20, 24, 30, 40, 60, 120, ». Table 15^2 gives values of g to 4 
decimals that are exceeded with probability P, where 0.00995 < P 
< 0.01000, for the same values of k and n. These two tables thus show 
the 5 percent and 1 percent levels of g. They are exact levels (to 4-deci- 
mal accuracy) whenever they exceed 0.5000 and in many other instances 
also. For example, a calculation of Cochran’s implies that the entries 
in Table 15.1 for n < 10 and k < 10 are exact 5 percent levels to 4-place 
accuracy. Presumably the entries of Table 15.2 for n < 10 and k < 10 
are likewise exact 1 percent levels to 4-place accuracy. Values of n and 
k exceeding 10 were selected for accuracy in interpolation. Thus n-wise 
interpolation should be carried out in terms of 12 /y/n; A>wise interpola¬ 
tion, in terms of 120/A; or 60/A;. 

3.2. Example 

The following are the squares of sample standard deviations obtained 
from 20 sets of six strength tests of cement mortar, each set being obtained 
from a different mixing of the same cement mortar under similar condi- 

* Since this chapter went to press, Thompson and Merrington have provided 
“Tables for Testing the Homogeneity of a Set of Estimated Variances.” Writing 
(3) above as B — M/C, they have tabled maximum and minimum values of the true 
5 percent and 1 percent significance levels of M for k — 3 to 15 and for 

ei - 3 (k - 1 )(C - 1) 

from Cfto 5 by steps of 0.5; 5 to 10 by steps of 1; 12; and 14. 
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tions. * Are these 20 squared standard deviations homogeneous? In 


Sample Number 

(S.D.) 2 . 



particular, is the standard deviation computed from the fifth sample 
“out of line”? 

In this case n = 5, k = 20, and g « 1,052/8,866 = 0.1187, which is 
less than the corresponding 5 percent leVel (0.1735) given in Table 15.1. 
The anomalous value of the standard deviation from the fifth sample may, 
therefore, be attributed to chance, and the 20 sets of strength tests may 
be regarded as homogeneous with respect to variability of the strength 
determinations. 

The squared standard deviations given above correspond to the 
definition 

6 

£ (a* - x ) 2 

(Sample standard deviation) 2 = ——^- (6) 


where x = sample average 


-iX* 


the Xi (i = 1, 2, 


6) being 


the 6 individual strength determinations of the sample concerned, in 
pounds per square inch. Consequently, each value is % of the cor¬ 
responding variance estimate s 2 , defined by 


o 

£ (*< - z ) 2 


However, the factor % is common to the numerator and denominator 
of g } so need not be taken into account explicitly. 

4. COMPUTATION OF TABLES 

Cochran has shownf that if $5, $£, . . . , are variance estimates 
distributed independently as x 2 * 2 / n for n degrees of freedom, and if 


«1 + « 2 + 


(i - 1, 2, 


* Davies and Pearson, “Estimating from Samples,” Table IV, p. 92; a descrip¬ 
tion of the tests appears on p. 77. 

f Cochran's r and n correspond to our n and h, respectively. 
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Then, for a given pair of values of k and w, the probability P that g exceeds the corresponding tabular entry satisfies the inequality 
0.04875 < P < 0.05000. For further details regarding accuracy of entries, see Sec. 3 of this chapter. 

















TABLE 15.2 

Upper 1 Percent Levels op the Ratio of the Largest to the Sum of k Independent Estimates op Variance, Each op Which 

Is Based on n Degrees op Freedom 
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HMD, for a given pair of values of k and n, the probability P that g exceeds the corresponding tabular entry satisfies the inequality 
0.00995 < P < 0.01000. For further details regarding accuracy of entries, see Sec. 3 of this chapter. 















Chap. 15 


392 TECHNIQUES OF STATISTICAL ANALYSIS 

then the probability that the largest of the ratios (8) exceeds g is given by 

P(g) - kPrig) - WLzJlptig) + — ~ ~- —r>(g) - • • ■ 

+ (-l)*->P*(g) (9) 

where Pi(g) is the probability that a specified one of the ratios exceeds g , 
P 2 ((/)1;he probability that a specified pair of the ratios both exceed g } and 
so on. Clearly 

Pk(g) = 0 when Q >\ (10) 


for it is certain that the largest ratio will be as great as l/k; so the number 
of non-zero terms in (9) is the greatest integer less than 1/g. 

Cochran has shown that 


Pi(g) 


/: 


n 



B 


n(t-l) 
(1-X) * 



dx 


o<g<i 


(ID 


and has given Pi(g) and Pi(g) in multiple integral form, from which the 
multiple integral form of Pj{g), j = 4, 5, • • • , k, is apparent. He has 
pointed out that, since Piig) < LPi(< 7)] 2 for all values of n and k , the 
upper a levels of significance of g can be evaluated to a good first approxi¬ 
mation using only the first term of (9). Thus, if g a is determined from 


Pi(5«) = | ( 12 ) 

it follows from (9) that 

a(l - ^) < a - a’ < P($ a ) < a (13) 

that is, upper 5 percent levels of g determined from (12) will in all cases 
correspond to 0.04875 < P < 0.05000; and, in view of (10), will be exact 
when they exceed Similarly, upper 1 percent levels determined 
from (12) will correspond to 0.00995 < P < 0.01000 and will be exact 
when they exceed H- 
From (11) it follows that 

$a(n,k) = 1 - Xa (14) 

F a 

- S 

Fa + (k — 1) 

% 


where 


n n(k — 1) 


(15) 
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is the lower a/k level of significance of the beta-function distribution, and 

Fa = F* [n, n(k - 1)] (16) 

k k 

is the upper a/k level of significance of the F distribution. Hence 
go.ob(n f k) can be obtained from the tables of Thompson, or from those of 
Thompson and Merrington, for k = 2, 5, and 10; and 0o.oi(n,A;) can be 
obtained similarly for k = 2. The values of ^o.os given in Table 15.1 for 
k = 2, 5, and 10, and of p 0 .oi given in Table 15.2 for k = 2, were obtained 
in this manner. 

When n is an even integer, (11) can be reduced to a polynomial in g. 
Thus, when n = 2 

. P x {g) - (1 - g] (17) 
and when n = 4 

Pito) - (1 - g)»~*\ 1 + 2(* - I)d ( 18 ) 

as Cochran has pointed out. The corresponding polynomials for n = 6, 
8, and 10 were determined. The values of go .os and £ 0.01 given in Tables 
15.1 and 15.2 for n = 2, 4, 6, 8, and 10 were obtained with the aid of 
these polynomials. In all other ca^es the values of ^ 0 .ob and £ 0.01 were 
evaluated from 0.05/A; and 0.01 /k levels of F } which, in turn, were deter¬ 
mined by means of a modification of a procedure due to Paulson. 

It is believed that as estimates of 0 O .ob and go .01 the entries of Tables 
15.1 and 15.2 are correct to within 1, or possibly 2, units in the fourth 
decimal place. The accuracy of the tabular entries as estimates of the 
true 5 percent and 1 percent levels of g is in general unknown, although 
it is known that the tabular values tend to exceed the true values. It is 
believed, however, that in all cases the tabular 5 percent levels correspond 
to 0.04875 < P < 0.05000 and the tabular 1 percent levels to 0.00995 
< P < 0.01000. With this understanding, Tables 15.1 and 15.2 should 
be useful in practical work until more accurate tables become available. 

5. REFERENCES 

Bartlett, M. S.: “Properties of Sufficiency and Statistical Tests,” Proceedings of 
the Royal Society of London , Vol. 160A (1937), pp. 268-282. 

-: “Some Examples of Statistical Methods of Research in Agriculture and 

Applied Biology,” Supplement to the Journal of the Royal Statistical Society , 
Vol. 4 (1937), pp. 137-183. 

Bishop, D. J., and Nair, U. S.: “A Note on Certain Methods of Testing for the 
Homogeneity of a Set of Estimated Variances,” Supplement to the Journal of 
the Royal Statistical Society, Vol. 6 (1939), pp. 89-99. 

Cochran, W. G.: “The Distribution of the Largest of a Set of Estimated Variances 
as a Fraction of Their Total,” Annals of Eugenics , Vol. 11 (1941), pp. 47-62. 
Davies, O. L., and Pearson, E. S.: “Methods of Estimating from Samples the 
Population Standard Deviation,” Supplement to the Journal of the Royal Statistical 
Society , Vol. 1 (1934), pp. 76-93. 



394 TECHNIQUES OF STATISTICAL ANALYSIS Chap. 15 

Hartley, H. 0.: “Testing the Homogeneity of a Set of Variances,” Biometrika , 
Vol. 31 (1939-1940), pp. 249-255. 

Paulson, Edward: “An Approximate Normalization of the Analysis of Variance 
Distribution,”- Annals of Mathematical Statistics , Vol. 13 (1942), pp. 233-235. 

Pearson, E. S., assisted by Sekar, C. Chandra: “The Efficiency of Statistical Tools 
and a Criterion for the Rejection of Outlying Observations/* Biometrika , Vol. 
28 (1936), pp. 308-320. 

Pitman, E. J. G.: “Tests of Hypotheses concerning Location and Scale Parameters,” 
Biometrika , Vol. 31 (1939-1940), pp. 200-215. 

Rider, Paul R.: An Introduction to Modern Statistical Methods , John Wiley & Sons, 
Inc., New York, 1939, 220 -f ix pp. 

Thompson, Catherine M.: “Tables of Percentage Points of the Incomplete Beta 
Function,” Biometrika , Vol. 32 (1941), pp. 151-181. 

-and Merrington, Maxine: “Tables of Percentage Points of the Inverted 

Beta (F) Distribution,” Biometrika , Vol. 33 (1943), pp. 73-88. 

—-and-: “Tables for Testing the Homogeneity of a Set of Estimated 

Variances,” Biometrika , Vol. 33 (1946,) pp. 296-304. 



CHAPTER 16 

INVERSE SINE TRANSFORMATION OF PROPORTIONS 


Page 


1. History of the Transformation.397 

2. Nature of the Transformation.399 

3. Effectiveness of the Transformation.401 

4. Technical Notes.408 

4.1 General Transformation to Stabilize Variance.408 

4.1.1 Basic Theorem.408 

4.1.2 Proof of Basic Theorem.409 

4.2 Applications of the General Procedure.410 

4.2.1 Proportion of Itemsd^OSF issing a Particular Attribute in a Sample. 410 

4.2.2 Mer*n of a Sample from a Poisson Exponential Population . . . .411 

4.2.3 Estimate of the Variance of a Normal Population Based on n Degrees 

of Freedom.412 

4.2.4 Correlation Coefficient Computed from a Sample from a Normal 

Population.413 

4.3 A Method of Discovering Bartlett’s Adjustment.413 

5. References.414 

5.1 Inverse Sine Transformation.414 

5.2 Mathematical Tables.416 

FIGURES 

16.1 Dependence of the Variances of Three Functions of a Sample Proportion p 

on the Population Proportion P, When the Sample Size Is 10.403 

16.2 Dependence of the Variances of Three Functions of a Sample Proportion p 

on the Population Proportion P } When the Sample Size Is 20 . 404 

16.3 Dependence of the Variances of Three Functions of a Sample Proportion p 

on the Population Proportion P, When the Sample Size Is 30 . 405 


TABLES 

16.1 Means and Variances of Various Functions of a Sample Proportion for 

Selected Values of the Population Proportion When the Sample Size Is 10. 400 

16.2 Means and Variances of Various Functions of a Sample Proportion for 

Selected Values of the Population Proportion When the Sample Size Is 20 . 401 

16.3 Means and Variances of Various Functions of a Sample Proportion for 

Selected Values of the Population Proportion When the Sample Size Is 30 . 402 

16.4 Values of <pb{ 0) and <pb( 1) for Samples of 10 to 50 Items Together with Values 

of the Variance of <pb when P — 0.5.406 


395 




















396 


TECHNIQUES OF STATISTICAL ANALYSIS 


Chap. 16 


This chapter was prepared by 
Churchill Eisenhart, National Bureau of Standards 



CHAPTER 16 

INVERSE SINE TRANSFORMATION OF PROPORTIONS 


1. HISTORY OF THE TRANSFORMATION 
If successive samples of n items each are drawn independently at 
random from a very large (infinite) population in which a proportion P 
of the items possess some definite attribute A, then the proportions 
pi, p^ pt, .. . of items possessing A in the successive samples will be 
distributed in such a manner that as the number of independent samples 
is increased without limit the average of the p’s will tend to P and the 
mean of their squared deviation from P will tend to P(1 — P)/n. In 
the language of statistics this is expressed by saying that, if a sample of n 
items is drawn at random from an infinite population in which a propor¬ 
tion P of the items possess a definite attribute A, and if the proportion of 
items possessing A in the sample is denoted by p, then 



Expected value of p = E(p) = P 

(1) 

and 


Variance of p = V(p) = — 

(2) 

or 


Standard deviation of p = <r p = yj—^- — — 

(3) 


Equation (3) or equation (2), considered in conjunction with equa¬ 
tion (1), states that observed proportions p, based on successive inde¬ 
pendent random samples of size n, may be expected to be grouped more 
closely about the true proportion P when n is large than when n is small; 
for fixed sample size, their sampling variation about P will be greatest 
when P equals and* will decrease toward zero as either P or 1 — P 
approaches zero. This dependence of the sampling variation of an 
observed proportion p upon the true proportion that it estimates, P, is 
a source of inconvenience in certain kinds of theoretical work and in some 
types of statistical analysis of research data. Basically, the incon¬ 
venience is a consequence of the fact that for purposes of statistical 
inference (for example, estimation of parameters, tests of agreement 
between observation and hypothesis) the worth or “weight” of an indi¬ 
vidual observation (here, a single observed proportion p) is generally 
inversely proportional to its sampling variance. Accordingly, the for- 
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mulas of statistical theory and the calculations of statistical analysis are 
simpler, in general, when the observations involved are of equal weight, 
that is, of equal precision. 

Recognizing this fact, R. A. Fisher, in a paper on theoretical genetics 
published in 1922,* introduced the transformation cos <p = 1 — 2p, 
where <p varies from 0 to 7r as p varies from 0 to 1, and utilized the fact 
that, when <p is computed from large samples, the variance of <p is, for all 
practical purposes, independent of the true proportion P. 

Fisher employed this transformation to simplify the mathematical 
equations occurring in an application of statistical theory to a problem in 
genetics. In 1930 he published another paper on this problemf in 
which he considered the bias of the transformation in large samples 
and showed that his failure to make allowance for this bias in his earlier 
paper accounted for the discrepancy between one of the results therein 
and a solution to the same problem obtained in a different manner by 
Sewall Wright. Not until 1936 were the possibilities of this transforma¬ 
tion in connection with statistical analysis of research data discussed in 
print, when three writers indicated advantages to be gained from its use 
in statistical analyses of data expressed in the form of observed propor¬ 
tions (or percentages of items counted). Bartlett, { on a suggestion from 
Yates, Fisher’s coworker at the Rothamsted Experimental Station, 
considered the use of a related transformation (sin 2 9 = p, where 6 varies 
from 0 to 7r/2; see below) as a means of rendering proportions amenable 
to analysis-of-variance techniques, which assume equal precision of the 
observations involved. With the same objective, Bliss,§ after some 
correspondence with Fisher, recommended the use of the angular trans¬ 
formation sin 2 6 = p in the statistical analysis of field-trial data expressed 
in percentages (that is, 100 times the proportion of items with a given 
attribute in a sample) and gave an extensive table of 6 (in degrees) for 
percentages (that is, 100p) from 0.01 to 99.99. Zubin|| pointed out the 
advantages of using Fisher’s transformation as a means of simplifying 
the calculations involved in testing the statistical significance of a differ¬ 
ence between two observed proportions, his attention having been 
drawn to Fisher’s papers by Hotelling. Zubin, utilizing this transforma¬ 
tion, subsequently prepared nomographs^ by means of which the statis¬ 
tical significance of the difference between two observed proportions can 
be determined almost at a glance. These nomographs jointly are, in a 
sense, the ancestors of the nomograph presented in Chap. 7 above. 

* Fisher, “On the Dominance Ratio,” p. 326. (For full citations, see Sec. 6.1.) 

t Fisher, “The Distribution of Gene Ratios.” 

t Bartlett, “Square Root Transformation in Analysis of Variance,” p. 73. 

$ Bliss, “The Analysis of Field Experimental Data Expressed in Percentages.” 

|| j&UBiN, “Note on a Transformation Function for Proportions and Percentages.” 

% Zubin, “ Nomographs for Determining the Significance of the Difference.” 
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Since 1936 a great deal has been written on the transformation of 
proportions to angles, to facilitate or validate various forms of statistical 
analysis. A fairly complete bibliography is given at the end of this 
chapter; it includes references to tables that have been published to 
facilitate the angular transformations in current use. 

The remainder of this chapter is devoted to a discussion of the effec¬ 
tiveness of Fisher's transformation in small samples, especially the 
dependence on P of the expected value and the variance of <p. A simple 
adjustment, due to Bartlett,* is discussed, and a table (Table 16.4) of 
adjusted values provided to facilitate its use. More elaborate adjust¬ 
ments, determined by the data themselves in any given case, have been 
discussed by Cochran, f and tables X are available to facilitate their use. 
In the last section of this chapter the general method of determining 
transformations to stabilize variance is outlined, the angular transforma¬ 
tion of Fisher derived, and several other such transformations noted. A 
method of discovering Bartlett's adjustment is noted, which may be use¬ 
ful in other situations. 


2. NATURE OF THE TRANSFORMATION 

Let p denote the proportion of items possessing some particular attri¬ 
bute in a sample of n items drawn independently at random from a very 
large (infinite) population of items in which a proportion P of the items 
possess the attribute in question. Under these circumstances the mean, 
variance, and standard deviation of p are functions of P, given by equa¬ 
tions (1), (2), and (3). Let <p be an angle (in radians) determined by the 
transformation 


<p = 2 arcsin y/p = arccos (1 — 2 p) 0 < p < 1 0 < <p <w (4) 

the inverse of which is 


p = sin 2 ~ = -- 3 ° S - 0 <p <1 0 < <p < 7T 


(5) 


Then, when the number of observations n upon which p is based is large, 


Expected value of <p = E(<p) (6) 

= <p(P) approximately 

where 

<p(P) = 2 arcsin \/P (7) 

* Bartlett, “Some Examples of Statistical Methods,” p. 168. 
t Cochran, “The Analysis of Variance When Experimental Errors Follow the 
Poisson or Binomial Laws, 11 pp. 342 ff. 

X Fisher and Yates, Statistical Tables. 
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and 

Variance of <p = V(<p) = - approximately* (8) 

w 

An elementary derivation of this transformation is given in Sec. 4. This 
is the form in which the transformation was introduced by Fisher in 1922; 
the transformation has been employed in this form by Zubin and, with 
certain modifications, by the Statistical Research Group, f 

Other writers, following Bartlett and Bliss, have transformed p to an 
angle 9 equal to <p/ 2, whence V(9) = 1/4 n approximately for 9 in radians 
or 821/n approximately for 9 in degrees. Thus, when 6 is expressed in 
radians, V(0) has, for a wide range of values of P , approximately the 
value of V(p) when P = This advantage, if such it is, is lost, how¬ 
ever, when 6 is expressed in .degrees. 

TABLE 16.1 

Means and Variances of Various Functions of a Sample Proportion for 
Selected Values of the Population Proportion When the Sample Size 

Is 10 


p 

■p(.p) 

E(p) 

40V(p) 

EM 

iovm 

E(<ps) 

10F(*jO 

0.0500 

0.4510 

0.0500 

0.1900 

0.2856 

1.2917 

0.4757 

0.4481 

0.1000 

0.6435 

0.1000 

0.3600 

0.5134 

1.6567 

0.6241 

0.7489 

0.1500 

0.7954 

0.1500 

0.5100 

0.7006 

1.6529 

0.7632 

0.9363 

0.2000 

0.9273 

0.2000 

0.6400 

0.8598 

1.5332 

0.8939 

1.0436 

0.3000 

1.1593 

0.3000 

0.8400 

1.1260 

1.2968 

1.1350 

1.1224 

0.5000 

1.5708 

0.5000 

1.0000 

1.5708 

1.1433 

1.5708 

1.1258 

0.7000 

1.9823 

0.7000 

0.8400 

2.0156 

1.2968 

2.0066 

1.1224 

0.8000 

2.2143 

0.8000 

0.6400 

2.2818 

1.5332 

2.2477 

1.0436 

0.8500 

2.3462 

0.8500 

0.5100 

2.4410 

1.6529 

2.3784 

0.9363 

0.9000 

2.4981 

0.9000 

0.3600 

2.6282 

1.6567 

2.5175 

0.7489 

0.9500 

2.6906 

0.9500 

0.1900 

2.8560 

1.2917 

2.6659 

0.4481 


Explanation 

Let P denote the proportion of items possessing a definite characteristic in an 
infinite jypulation 

p denote the proportion of items possessing this characteristic in a sample of 
10 items drawn independently at random from the population 
Then *>(p) * 2 arcsin y/p ** arccos (1 — 2 p), 0<p<l,0<*><ir 
vb(p) “ <p(p) for 0 < p < 1, ^b(O) = <p(Ho), <pb( 1) — v — <p B ( 0) 
and E( ) denotes average or expected value of the quantity within the parentheses 
F( ) denotes variance of the quantity within the parentheses. 


* If <p is measured in degrees 


FW 



3282.8 

n 


t Sequential Analysis, Sec. 3.6; also Chap. 7 of this volume. 
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3. EFFECTIVENESS OF THE TRANSFORMATION 
Apparently the only published investigation of the effectiveness of the 
inverse sine transformation in small samples is that of Bartlett* for the 
case of n = 10. His analysis shows that when n = 10, the variance of <p 
depends very markedly on the true proportion P. A year later he 
remarked t that a good correction is to replace p = 0 by p = 1/4n, and 
p = lbyp = l — l/4n, although he gave no supporting evidence. 

Tables 16.1 to 16.3 show, for n = 10, 20, and 30, respectively, the 
values to 4 decimal places of E(p), 4 nV(p), <p(P), E(<p), nV(<p), E(<p B ) f 
and nV(<p B ) corresponding to P = 0.05 to 0.20 by steps of 0.05, 0.30, 
0.50, 0.70, and 0.80 to 0.95 by steps of 0.05, where <p B denotes the angular 
transformation (4) modified as suggested by Bartlett, that is, 

iPa(p) = 2 arcsin y /p = arccos (1 — 2 p) 0<p<l 0 <(Pb<tt 

*Pb(0) — 2 arcsin VT/4n, upj 1) t — <p B ( 0) ' ' 

TABLE 16.2 

Means and Variances of Various Functions of a Sample Proportion for 
Selected Values of the Population Proportion When the Sample Size 

Is 20 


p 

v(P) 

E( V ) 

80 V(p) 

EM 

20VM 

E(<pb) 

20 V(<p B ) 

0.0500 

0.4510 

0.0500 

0.1900 

0.3541 

1.6315 

0.4344 

0.7247 

0.1000 

0.6435 

0.1000 

0.3600 

0.5866 

1.5458 

0.6139 

1.0138 

0.1500 

0.7954 

0.1500 

0.5100 

0.7602 

1.3240 

0.7088 

1.0974 

0.2000 

0.9273 

0.2000 

0.6400 

0.9035 

1.1810 

0.9061 

1.1051 

0.3000 

1.1593 

0.3000 

0.8400 

1.1469 

1.0858 

1.1471 

1.0784 

0.5000 

1.5708 

0.5000 

1.0000 

1.5708 

1.0562 

1.5708 

1.0561 

0.7000 

1.9823 

0.7000 

0.8400 

1.9947 

1.0858 

1.9945 

1.0784 

0.8000 

2.2143 

0.8000 

0.6400 

2.2381 

1.1810 

2.2355 

1.1051 

0.8500 

2.3462 

0.8500 

0.5100 

2.3814 

1.3240 

2.3728 

1.0974 

0.9000 

2.4981 

0.9000 

0.3600 

2.5550 

1.5458 

2.5277 

1.0138 

0.9500 

2.6906 

0.9500 

0.1900 

2.7875 

1.6315 

2.7072 

0.7247 


Explanation 

Let P denote the proportion of items possessing a definite characteristic in an 
infinite population 

p denote the proportion of items possessing this characteristic in a sample of 
20 items drawn independently at random from the population 
Then tp(p) ■■ 2 arcsin y/p «■ arccos (1 — 2 p), 0 ^ p <£ 1, 0 < <p < ir 
vb(p) ■ for 0 < p < 1, ^s(O) ■■ ^b(I) • ir — v>a(0) 

and E( ) denotes average or expected value of the quantity within the parentheses 
V{ ) denotes variance of the quantity within the parentheses. 

* Bartlett, “Square Root Transformation in Analysis of Variance,” pp. 74-75. 
t Bartlett, “Some Examples of Statistical Methods,” p. 168. 
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The mean values and variances of <p and <pb were determined by direct 
calculation.* 

If the inverse sine transformation completely stabilized variance in 
small samples, then nV(<p) would be unity in all cases, or at least would 
be constant for a given value of n. That this is not the case is clear 
from Tables 16.1 to 16.3. Evidently, nV(<pu) is somewhat less variable 
than nV(<p) over the range of values of P considered. The nature of 
the dependence of nV(<p) and nV((p B ) on P is depicted graphically in 
Figs. 16.1 to 16.3, for n = 10, 20, and 30, respectively. For purposes of 
comparison 4 nV(p) has been plotted also. The following conclusions 
may be drawn: 

(1) For n > 10 and 0.05 < P < 0.95, nV(<p) and nV(<p B ) are less 
variable than 4 nV(p). 


TABLE 16.3 

Means and Variances OF t Various Functions of a Sample Proportion for 
Selected Values of the Population Proportion When the Sample Size 

Is 30 


p 

hp) 

E( V ) 

120 V (p) 

EM 

ZOVM 

E(<pb) 

30 F (<p B ) 

0.0500 

0.4510 

0.0500 

0.1900 

0.3885 J 

1.6443 

0.4278 

0.8985 

0 1000 

0.6435 

0.1000 

0.3600 

0.6119 

1.3359 

0.6197 

1.0921 

0.1500 

0.7954 

0.1500 

0.5100' 

0.7757 

1.1560 

0.7771 

1.0987 

0.2000 

0.9273 

0.2000 

0.6400 

0.9134 

1.0885 

0.9136 

1.0773 

0.3000 

1.1593 

0.3000 

0.8400 

1.1516 

1.0484 

1.1516 

1.0481 

0.5000 

1.5708 

0.5000 

1.0000 

1.5708 

1.0358 

1.5708 

1.0358 

0.7000 

1.9823 

0.7000 

0.8400 

1.9900 

1.0484 

1.9900 

1.0481 

0.8000 

2.2143 

0.8000 

0.6400 

2.2282 

1.0885 

2.2280 

1.0773 

0.8500 

2.3462 

0.8500 

0.5100 

2.3659 

1.1560 

2.3645 

1.0987 

0.9000 

2.4982 

0.9000 

0.3600 

2.5297 

1.3359 

2.5219 

1.0921 

0.9500 

, 

2.6806 

0.9500 

0.1900 

2.7531 

1.6443 

2.7138 

0.8985 


Explanation 

Let P denote the proportion of items possessing a definite characteristic in an 
infinite population 

p denote the proportion of items possessing this characteristic in a sample of 
30 items drawn independently at random from the population 
Then *(p) ** 2 arcsin y/p — arccos (1 — 2p), 0 < p < 1, 0 < <p <ir 
<pb{p) = #>(p) for 0 < p < 1, <pb(0) = <p()4 20 ), <pb( 1) * x — <pb( 0) 
and E( ) denotes average or expected value of the quantity within the parentheses 
V( ) denotes variance of the quantity within the parentheses. 


* Binomial probabilities to 7 decimals were obtained from Tables of the Incomplete 
Beta Function; arcsin values to 6 decimals were obtained by interpolation in Hayashi’s 
Sieben-und mehrstellige Tafeln , the entries of this latter table being corrected in some 
cases in accordance with the list of corrections given in Mathematical Tables and Other 
Ai& to Computation, Vol. 1, No. 6, p. 197. 
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(2) For n > 20 and 0.05 < P < 0.95, nV{<p B ) is less variable than 
nV(<p). 

(3) For n > 10, and 0.075 < P < 0.925, nV((p B ) is less variable than 
nV(<p). 

Since it is quite apparent from the foregoing that the inverse sine 
transformation, especially with Bartlett’s correction, is advantageous 

Let P denote the proportion of items in an infinite popuiatlcn that possess a given characteristic; 

r denote the proportion of items possessing the characteristic among the 10 items drawn 
independently at random from the population; 

d>(p) ■ 2arcsin Vp - arccos(1-2p); 

<t> 8 (p)-<l>(p),0<p<1; d> B (0)* 0(1/40); <t> B <1) - ir - 4> B (0); 
then V 1 “4P(1-P)*4 k 10xV(p) 

V 2 *10 V(<t>) 

V 3 -10 V(<P B ) 



Fia. 16.1.—Dependence of the variances of three functions of a sample proportion p on the 
population proportion P when the sample size is 10. 


from the viewpoint of variance stabilization, its effectiveness in this 
regard increasing rapidly as the sample size n is increased, the question 
arises whether estimates of P obtained by transforming back to propor¬ 
tions after averaging in angular units will be biased. A comparison 
of the values of <p(P), EM, and E(<p B ) shown in Tables 16.1 to 16.3 
reveals that E(<p) and E(<p B ) generally are less than (p(P) when P < 
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greater than <p(P) when P > and the three are equal when P = 
Furthermore, for all values of n and P considered in these tables, E(<p B ) 
is the closer to <p(P). From Tables 16.1 to 16.3 it is seen that, when 
0.05 <P< 0:95, the bias E{<p B ) — <p(P) is less than 15 per cent of the 

l_et P denote the proportion of items in an infinite population that possess a given characteristic; 

p denote the proportion of items possessing the characteristic among the 20 items drawn 
independently at random from the population; 

$(p) ■ 2arcsin Vp * arccos(1-2p); 

4>e<P>-«>(p), 0<p<1; 4> b (0)- 0(1/80); O b (1) - tt-O 8 (0); 
then V 1 -4P(1-P)-4x20x V(p) 

V e -20 V(O) 

V s -20 V(O b ) 



P 

Fia. 16.2.—Dependence of the variances of three functions of a sample proportion p on the 
population proportion P when the sample size is 20. 


standard deviation of <p B when n > 10 and thus is of negligible magnitude 
for most practical purposes. Thus Bartlett's correction, in addition to 
its advantage from the viewpoint of stabilizing variance, tends to reduce 
to negligible magnitudes the biases of estimates of P obtained by recon¬ 
verting averages expressed in angular units to estimated mean propor¬ 
tions with the aid of equation (5). 

* The values of E{<p B ) for P = 0.05 and P *« 0.95 when n ■* 10 are exceptions. 
Such exceptions can be expected to occur with <pb in those instances in which the p =* 0 
or p — 1 class has a large probability of occurrence, that is, when P is close to 0 or 1 

and fl is small. 
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Values of ^b(O) and <pb(1), expressed in radians, have been tabulated in 
Table 16.4 for sample sizes from 10 to 50, in order to facilitate the use 
of Bartlett’s correction. The values of <pb(0) and v> B (l) given in Table 
16.4 may be used in conjunction with Zubin’s “Table for Transforming 
Proportions to T values”* without modification; they should be divided 


Let P denote the proportion of items in an infinite population that possess a given characteristic; 

p denote the proportion of items possessing the characteristic among the 30 items drawn 
independently at random from the population; 

$(p)- 2arcsin Vp - arccosl1-2p); 

0 B (p)-<t>(p), 0<p<1; <t> 8 (0)- 0(1/120); <t> B (1)-7T-<t> 8 (0); 

then V,-4P(1-P)-4 x 30 x V(p) 

V 2 -30 V(<P) 

■ 30 V(<P B ) 



by 2 before being used in conjunction with Bartlett’s tablet; they should 
be multiplied by 28.64789 before being used in conjunction with the 
tables of Bliss, Fisher and Yates, and Snedecor. 

A previous publication of the Statistical Research Group gives a 
tablet of the transformation 

* Zubin, “Note on a Transformation,” Table 4, p. 220. 
t Bartlett, “Subsampling for Attributes,” Table II, p. 132. 
t Sequential Analysis, Table 3.64, p. 3.67. 
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y(s,n) = 2 arcsin y/$/n 0<8<n,0<y<r 

2 /( 0 , n) = \/2 Jn for s = 0 (10) 

y(n,n) = tt — y/2jn for $ = n 

for values of n from 10 to 50. Comparison of the foregoing with equation 
(9) above reveals that y(s) = <pb(p) with p = s/n for 0 < s < n; 
2 /( 0 ) = \/2<pb(0) approximately and y(n) = tt — \/2^(0) approxi¬ 
mately. Since the publication of this table of the y transformation, 
it has been found that Bartlett’s adjustments, (p B ( 0) and <p B ( 1), are 
slightly more effective for stabilizing variance, especially when P is 

TABLE 16.4 

Values of *>b(0) and <pb(1) foe Samples of 10 to 50 Items Together with Values 
of the Variance of <pb When P = 0.5 


Sample tp B ( 0), 
size, n radians 



2.8519 

2.8633 


0.0918 

0.0840 


0.2589 

2.8827 

0.0719 

0.2507 

2.8909 

0.0670 

0.2431 

2.8985 

0.0628 

0.2363 

2.9053 

0.0591 

0.2299 

2.9117 

0.0558 

0.2241 

2.9175 

0.0528 

0.2187 

2.9229 

0.0501 

0.2136 

2.9280 

0.0477 

0.2089 

2.9327 

0.0456 

0.2045 

2.9371 

0.0436 

0.2003 

2.9413 

0.0417 

0.1964 

2.9452 

0.0401 

0.1927 

2.9488 

0.0385 

0.1893 

2.9523 

0.0371 

0.1860 

2.9556 

0.0358 





■ 





1 











0.1692 

2.9724 

0.0294 

0.1669 

2.9747 

0.0286 

0.1646 

2.9770 

0.0278 

0.1624 

2.9792 

0.0270 

0.1603 

2.9813 

0.0263 

0.1583 

2.9833 

0.0257 

0.1563 

2.9853 

0.0250 

0.1545 

2.9871 

0.0244 

0.1526 

2.9889 

0.0238 

0.1509 

2.9907 

0.0233 

0.1492 

2.9924 

0.0227 

0.1476 

2.9940 

0.0222 

0.1460 

2.9956 

0.0217 

0.1445 

2.9971 

0.0213 

0.1430 

2.9986 


0.1415 




Explanation 

P denote the proportion of items possessing a definite characteristic in an 
infinite population 


p denote the proportion of items possessing this characteristic in a sample 
of n items drawn independently at random from the population 
Then <pb(v) " 2 arcsin y /p » arccos (1 — 2p), 0 < p < 1 , 0 < <p < v 
?*(0) 2 arcsin \Zl/4n; ^s(l) — v — ^b(O) 

and Vo.i(<pb) denotes the varianc e of <p b when P =» 0.5. 

Note: Whenn > 50, <pb( 0) — y/l/4 n, <pb( 1) — ?>B(l),anAFo.6(tf>ji) — 1/n + 1/n* 

are correct to within 1 unit in the fourth decimal place. 














Sec. 3 


INVERSE SINE TRANSFORMATION OF PROPORTIONS 407 


near 0.05 or 0.95, than the simpler adjustments t/(0,n) and y(n,n). 
It is recommended, therefore, that, when special accuracy is desired, 
the values of <p B (0) and y?b( 1) given in Table 16.4 of the present chapter be 
used in place of the values of 2 /( 0 ,ft) and y (ft,ft) shown in the table of the 
y transformation. 

From Figs. 16.1 to 16.3 it is clear that, for a fairly wide range of values 
of P about P = 0.5, the variance of <p By nV(<p B ) being plotted as Vz in 
these figures, is quite close to its value fcr P = 0.5, say Fo.s (<Pb) } and 
that Fo.s (<Pb) approaches 1/ft as n increases. Indeed, from a considera¬ 
tion of these three cases (ft = 10, 2*0, 30) it appears that 


Vo.M 


1 0.974 3.08 2.4 

ft ft 2 ft® 7ii 4 


(ID 


Using this formula, values of I r o.t(<p B ) v ere computed for intermediate 
values of ft, that is, for IQ < n < 20 and 20 < ft < 30, and, in the 
absence of any other formula giving correct values for n = 10, 20, and 
30, this formula was used to compute values of V 0 .s(< pb) for n from 31 
to 50. These values are tabulated, together with the known values for 
ft = 10, 20, and 30, in Table 16.4 and provide, it is believed, a reasonably 
accurate picture of the convergence of V(<p B ) toward its theoretical value 
of 1 /ft. The values of V o.b (<pb) given in Table 16.4 are slightly larger than 
the values of the variance of y given in the table of the y transformation 
referred to above, but for n >25 the excess is very slight, being roughly 
1 unit or less in the fourth decimal place. 

Fisher and Yates have suggested that, when special accuracy is 
desired, observed angles be replaced by adjusted angular values in much 
the same manner as observed probits are replaced by adjusted probits. 
They have given a table to facilitate such adjustments.* The adjust¬ 
ments proposed by Fisher and Yates are functions of the actual propor¬ 
tions observed in a given instance and thus are of a wholly different type 
from Bartlett’s adjustment, which may be regarded as a convenient rule 
of thumb. The theoretical basis of the adjustments advocated by 
Fisher and Yates has been given in a recent paper by Cochran, f 

Cochran has given an excellent discussion f of the presentation of 
results of statistical analyses in which transformations of original data 
have been employed to facilitate their statistical analysis. Curtiss§ has 
provided a general mathematical theory of transformations of data to 
render their analysis more tractable. 

* Fisher and Yates, Statistical Tables , Table XIV. 

t Cochran, “ The Analysis of Variance When Experimental Errors Follow the 
Poisson or Binomial Laws,” p. 342. 

t Cochran, “ Analysis of Replicated Experiments,” p. 168 and p. 170. 

{ Curtiss, “On Transformations.” 
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In conclusion, it must be remembered that while in large samples the 
variance of a transformed proportion is independent of the true propor¬ 
tion for most practical purposes, the variance of the transformed value <p 
is still proportional to 1/n, where n is the number of observations upon 
which the observed proportion is based. Consequently, if pi, p 2 , p 8 , . . . 
are proportions based on different numbers of observations ni, n 2 , n«, . . . 
and if the n ’s differ widely, then the variances of the corresponding angular 
values may be so unequal, because of the variation of the n’s, that the 
advantages of the transformation are lost almost entirely. Similarly, the 
advantages of the inverse sine transformation may be lost to a large extent 
when the data to be analyzed are subject to other forms of variability in 
addition to sampling variation so that the variance of an observed pro¬ 
portion p is given by P{ 1 — P)/n + <r 2 e , where the first term represents 
the variation of simple sampling and the second term the extraneous 
variation. In a recent paper* Cochran has discussed the analysis of 
fractions or percentages based on different numbers of observations (a) 
when extraneous variation is absent and ( b ) when extraneous variation 
is present. 

4. TECHNICAL NOTES 

4.1. General Transformation to Stabilize Variance 


4.1.1. Basic Theorem. —The transformation of random variables to 
stabilize variance may be based upon the following theorem: 

Let 0, (0i < 0 < 0 2 ), be a parameter of a statistical universe (pop¬ 
ulation), and let T be an estimator of 0, [2\(0) < T < T 2 (0)], based on 
n observations from the universe such that 


Expected value of T = E(T) = d 

Variance of T m V(T) m E(T - 0) 2 = 2^ 
Third moment of T about 0 = Ma(?') = E(T — 0) 3 

= °(^) asn_> °°tl 

Fourth moment of T about 0 = m(T) = E{T — 0) 4 

-CD- 


( 12 ) 


as n 


* Cochran, “Analysis of Variance for Percentages Based on Unequal Numbers.^ 

t By <p(n) « o(n k ) as n -» » we mean that lim <p(ri)/n k 0. In particular, o(l) 

n—* » 

means a function that tends to zero as n —> *>. 

By <p(n) « 0(n*) as n » we mean that V(w)| < K • n k for n > n 0 , where K 
is finite and does not depend upon n, and n a is some (large) value of n. In particular, 
0(1) means a function that is bounded asn-+ «. 

This notation is in agreement with that introduced by E. Landau m 1909 and is 
now lirell established. 
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If the function 

(,3) 

and its first two derivatives exist for all admissible values of T , except 
perhaps for a set of measure zero, and if the mean value of its second 
derivative and of the square of the second derivative are finite for 
61 < 6 < 62 and every value of n, that is, if 


and 


as n 


E[f(*\T)] - 0(1) 

E[p\T)Y = 0(1) 

for < 0 < 02 and T v { 0 ) < T < T 2 (0), tnen 


(14) 


VV(T)] = i + * (;) as n— >oo ( 15 ) 

The first of assumptions (12) implies that T is an unbiased estimator 
of 0 . The three further assumptions are introduced for the purposes of 
the theorem and are not very restrictive. Indeed, assumptions (12) are 
generally satisfied when T is some sort of average that has been adjusted 
so as to be an unbiased estimator of a population parameter of interest. 

4.1.2. Proof of Basic Theorem.—Let f(T) be a function of T such 
that its second derivative f ( 2 ) (T) exists for Ti( 0 ) < T < T 2 (0), except 
perhaps for a set of points of measure zero, where 7\(0) < T < 7 7 2 (0) 
is the admissible range of values of T 7 , that is, P[7\(0) <T < T 2 (0)(0] == 1. 
Since E(T) s 0 by assumption, it follows that 0 lies in the admissible 
range of values for T. Hence f(T) may be expressed as follows, with 
the aid of Taylors formula and Lagrange’s form of the remainder term: 


m = m + /<»(<?) + 21 e)a /<»(<) (i6) 


where t ~ 0 + fr(T — 0), 0 < <1, and the notation £ r is used to 

emphasize the fact that the value of £ depends upon the value of T con¬ 
cerned, T\{0) < T < T 2 (0). 

Taking the mathematical expectation of each side of equation (16), 
it is found that 

E\J{T)\ = m + • E[f^(t)] (17) 

= /(0) + o(l) as n oo 

by virtue of assumption (14) and the second of assumptions (12). This 
shows that these assumptions, together with those regarding the exist¬ 
ence and differentiability of f(T), are sufficient to ensure that, as n 
increases without limit, the expected value of the function/(T) converges 
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stochastically to /(0), the value of the function when T assumes its 
expected value. 

Subtracting.the left and right sides of (17) from the left and right 
sides, respectively, of (16), squaring, and taking expected values, it is 
found that 

Vlf(T)] ® E{f(T) — E[f(T)]} 2 (18) 

= [/«>(*)]* • V (T) + + 

/(»«»• E[/(«(f)] M3 (r) 

' r i 

In view of the assumptions (12) and (14), this becomes 

F[/(T)] = [/<"W]? • V(T) + o Q) as n -> « (19) 

= [/ (1) £0)] 2 ' + 0 as ra —» oo 

Consequently, for the function defined by equation (13), equation (19) 
reduces to equation (15). This completes the proof. 

Curtiss* has considered somewhat more fully the mathematical 
aspects of transformations of random variables to simplify their variances, 
and his paper should be consulted by persons interested in a compre¬ 
hensive mathematical treatment. 

4.2. Applications of the General Procedure 

4.2.1. Proportion of Items Possessing a Particular Attribute in a 
Sample. —Let 0 = P, the proportion of items in an infinite population 
that possess a particular characteristic A, and let T = p, the proportion 
of items possessing A in a sample of n items drawn independently at 
random from the population. It is well known that 

E(p ) = P 

V(p) - (20) 

Tv 

, , 3P’(1 - Pp , P(1 - P)(l - 6P + 6P*) 

«<P)-+-S?- 1 

Clearly m»(p) and *e«(p) are both o(l/n) as n —► », and g(P) = P(1 — P). 
Consequently, if there is a transformation that will stabilize the variance, 
the theorem implies that it will be 

f(.P) — [ —7===== = 2 arcsin \/p + C = arccos (1 — 2p) + C (21) 
J Vp( 1 - P) 

* Cxjbtiss, "On Transformations Used in the Analysis of Variance.” 
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where C is an arbitrary constant, which may conveniently be taken as 0. 
This yields the transformation <p(p), defined in equation (4), <p(p) being 
expressed in radian measure. It is not difficult to show that the second 
derivative of <p(p) satisfies the requirements of the transformation 
theorem for 0 < P < 1, whence 


n. W i = i + .(i) 


as ft oo 


( 22 ) 


4.2.2. Mean of a Sample from a Poisson Exponential Population.— 

Let 0 = p, the parameter (mean) of a Poisson exponential population, 
and let T = x, the mean of a random sample of n independent observa¬ 
tions from this population. It is well known that 


E{$) = p 

- * 

v n 

*(*) - ( 23 ) 


Clearly pa(x) and m(x) are both o(l/n) as n —> 00 , and g(p) = p. Conse¬ 
quently, the theorem implies that, if there exists a transformation that 
will stabilize the variance, it will be 

fix) = J'0* = 2 V* + C (24) 


It is convenient to take (7 = 0. Bartlett* chose to drop the 2 also 
and recommended the transformation y = The first and second 

derivatives of y with respect to x are 


and 


^ y " = ipyn 

E(y') = — ^—p= approximately 

2 vV 

E{y") - approximately 
[E(y")Y - approximately 


(25) 


( 26 ) 


* Bartlett, “Square Root Transformation.” 
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Hence, equation (18) above becomes for y = V# 


V{y) = t- -F 


1-3 . 

Uft 2 y 2 n z ) 


+ ■ 


4 n 64 ixn 2 64 


8/x n 


2 approximately (27) 


1 


1 + 1 


4 n 32 nn 2 64 fj, 2 n z 
Therefore, asn-> oo for fixed n 

fw-k + 'O) 


and as /u —> oo for fixed n 


r V-i + ‘0 


(28) 

(29) 


Consequently, if either n of n is large, V(y) = l/4n, approximately. 

4.2.3. Estimate of the Variance of a Normal Population Based on n 
Degrees of Freedom. —Let d = a 2 , the variance of a normal population, 
and let T = s 2 , an estimate of <r 2 such that ns 2 /<r 2 is distributed as x 2 
for n degrees of freedom. It is not difficult to show that 


E(s 2 ) 

V(s 2 ) 

Ms(s 2 ) 

M4(s 2 ) 


o 

<f M 
2 a 4 

n 

8 a 6 
n 2 

12(n -f 4)<r 8 


(30) 


Clearly Ma and /m are o(l/n) as n —> oo and < 7 (<r 2 ) == (<r 2 ) 2 . Therefore, the 
theorem implies that, if a transformation that will stabilize the variance 
exists, it will be 

vf ! + c ' V5lo ‘- !+e <3I) 

For convenience, we may take (7 = 0 and consider the transformation 
V — log. s 2 = 2.30259 logio s 2 . Since 

E(y') = ^ approximately 

£7(2/") = “ “i approximately (32) 

[^(t/")] 2 = ^ approximately 
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the type of reasoning employed in connection with the preceding example 
shows that 


VOog. ,*)-?+,. (I) 


as n —> oo for fixed <r 2 
as <r 2 —► oo for fixed n 


(33) 

(34) 


Consequently, y(log, s 2 ) = 2/n, approximately, provided either n or cr 2 
is sufficiently large. 

4.2.4. Correlation Coefficient Computed from a Sample from a Normal 
Population. —Let B = p, the product moment coefficient of correlation 
in a bivariate normal population, and let T = r, a correlation coefficient 
computed from a sample of n observations drawn independently at 
random from the population. The mean, variance, and higher moments 
of r are exceedingly complicated functions of p and n, but when n is 
large and |p| <K 1, 

E(r) *= p approximately 

V(r) * H approximately 

Ms (r) = o as n —^ oo ^ 

P4 (r) = o as n —> oo 

Consequently, the theorem suggests the use of the following trans¬ 
formation to stabilize the variance: 

fM-f+ + c < 3 « 

Taking C = 0, this yields 

a = i log. YZT r = tanh-‘ r (37) 

the transformation of r introduced by R. A. Fisher.* 

4.3. A Method of Discovering Bartlett’s Adjustment 
Let the values of 

<p(p) = 2 arcsin Vp (38) 

* Fisher, “On the 1 Probable Error' of a Coefficient of Correlation Deduced from 
a Small Sample,” Metron , Vol. 1 (1921), pp. 1-32. 
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where 

V - \ (* - 0, 1, 2, • • • , n) (39) 

be plotted on the arithmetic scale of arithmetic-probability paper against 
the cumulative probability 

X 

Xn(^b)i p ' (I - p) " <»> 

r-0 

plotted on the probability scale, for a particular value of n and a selected 
value of P. If n > 10 and 0.05 < P < 0.95, it will be found that the 
plotted points for 2 < x < n — 2 lie very nearly on a straight line, that 
the points for x = 1 and x = n — 1 depart slightly from the extension 
of this line, but that the points for x = 0 and x — n lie far off the exten¬ 
sion of this line. If the values ^(0) = 0 and <e(l) = ir are replaced by 
scores <p(0) = a« and <?(1) = 6« such that the corresponding plotted 
points fall on a gently curving extrapolation of the curve through the 
plotted points for 2 < x < n — 2, it will be found that 

a n = 2 arcsin 
b n = 7r — a« approximately 

The foregoing suggests that a better adjustment than Bartlett’s 
could be obtained by extending in both directions the line of best fit 
through the points for 2 < x < n — 2 and using this line to define the 
scores for x =* 0, 1, n — 1, and n. This was tried and some improve¬ 
ment was noted, but for n > 10 and 0.05 < P < 0.95 the improvement 
did not appear to be worth the trouble. 

This procedure may be found useful for obtaining adjustments of the 
Bartlett type in connection with other transformations to stabilize variance. 

5. REFERENCES* 

5.1. Inverse Sine Transformation 

Bartlett, M. S.: “Square Root Transformation in Analysis of Variance,” Supplement 

to the Journal of the Royal Statistical Society f Vol. 3 (1936), pp. 68-78. 
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inverse sine transformation was put out by the Codex Book Company, Inc., Norwood, 
Mass. The paper was designed by Frederick Mosteller and John W. Tukey, pursuant 
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the Qodex Book Company. 

The paper may be ordered in two weights, heavy (no. 32,298) and light (no. 
31,298^. The sheets measure 8% by 11 in. 


V: 


approximately 


(41) 
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CHAPTER 17 

ELEMENTS OF SEQUENTIAL ANALYSIS* 


1. INTRODUCTION 

1.1. Nature of Sequential Tests 

A sequential test of a statistical hypothesis consists of making a 
certain calculation after each observation (or group of ^observations) and 
deciding on the basis of this calculation whether (1) to a’ccept the hypothe¬ 
sis under test, (2) to accept the alternative to that hypothesis, or (3) to 
suspend judgment pending the examination of more data. The criteria 
for these decisions are such t&atJie hypothesis is accepted or rejected as 
soon as, but not before, the data are sufficient for doing so with prede¬ 
termined degrees of reliability. 

1.2. Advantages and Disadvantages 

Sequential is superior to nonsequential analysis whenever (1) the 
data become available serially and (2) the cost of the data (in terms of 
time, labor, material, and so forth) is more or less proportional to the 
amount of data. Nonsequential analysis is superior whenever (1) the 
amount of data is fixed or (2) the cost of the data is largely overhead, 
hence more or less independent of the amount of data. Superiority con¬ 
sists in minimizing a set of quantities (N,a,/3), where N is the number of 
observations, a is the risk (that is, the probability) of erroneously 
rejecting the hypothesis under test, and /3 is the risk of erroneously 
accepting this hypothesis. Sequential and nonsequential tests differ in 
the constraints under which this set is minimized. Nonsequential tests 
treat N as fixed and are so designed that with either risk a or 0 fixed the 
other is minimized. Sequential tests treat the number of observations N 

* Sequential analysis was developed in the Statistical Research Group in 1943, 
the theory principally by A. Wald and the techniques of application by numerous 
members of the group, including Kenneth J. Arnold, H. A. Freeman, Milton Friedman, 
M. A. Girshick, Edward Paulson, A. Wald, and W. Allen Wallis. 

This chapter is an extension of a memorandum by the author of the chapter, 
“Sequential Tests of Statistical Significance,” which is cited as “SRG 180“ in Appen¬ 
dix B of Sequential Analysis . A more extensive exposition of the theory than is given 
here will be found in Appendix B of Sequential Analysis , and a more extensive sum¬ 
mary of methods of application than is given here will be found in “Sequential Tests.” 
For a complete and rigorous account of the theory, see “Sequential Tests”; for a full 
account of methods of application and interpretation, including computational aids, 
see Sequential Analysis . (For full citations, see Sec. 8.) 
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as a variable and are so designed that, for fixed risks a and P, the expected 
(that is, average) number of trials ft required to reach a decision is 
minimized. If for a nonsequential test N is made large enough so that, 
with a fixed, P will not exceed a predetermined amount, this value of N 
will exceed (frequently by as much as 100 percent) the ft required with 
a sequential test for the same a and p. Thus, when N is readily subject 
to variation, sequential tests are superior; when N is not readily varied, 
nonsequential tests are superior.* 

1.3. General Procedure 

The basic quantity computed after each observation in a sequen¬ 
tial test is known in statistical literature as the “likelihood ratio.” 
The probability of obtaining the observations actually at hand is com¬ 
puted first on the assumption* that the hypothesis under test (designated 
the “null hypothesis”) is true, and second on the assumption that the 
alternative to this hypothesis (the “alternative hypothesis”) is true. If 
these probabilities are designated by P 0 and Pi, respectively, the likeli¬ 
hood ratio is simply Pi/Po. In a sequential test it is computed after each 
observation (or group of observations), each computation being based 
on all the observations available. If the likeljhood ratio ever reaches a 
certain level A (A > 1), the collection of data (experiment) halts and 
the alternative hypothesis is accepted. If the likelihood ratio ever falls 
to a certain level 1/P (P > 1), the experiment ceases and the null 
hypothesis is accepted. When the likelihood ratio is between A and 
1/P, the evidence is insufficient for either decision with the required 
reliability. The required reliability, that is, the a and p of the preceding 
paragraph, completely determines A and P; in fact, A = (1 — p)/a and 

P = (1 - a)/fi. 

Since in most cases the successive observations are independent, the 
likelihood ratio for a set of observations is simply the likelihood ratio for 
the last observation multiplied by that for the set of preceding observa¬ 
tions. Consequently, using the logarithm of the likelihood ratio reduces 
the arithmetic to addition and subtraction. After each observation, 
simply add log Pi to and subtract log Po from the algebraic sum of such 
quantities for all preceding observations. If this sum ever reaches 

a = log A = log (l) 

accept the alternative hypothesis; if it ever falls to —6, where 

* A detailed discussion of the conditions under which sequential analysis is 
advantageous and those under which it is disadvantageous is given in Statistical 
Research Group, Sampling Inspection , pp. 33-38 and 91-97. Although the discussion 
there is in terms of specific circumstances, its generalization is obvious. 
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b = log B = log (2) 

accept the null hypothesis. Otherwise, continue the experiment. 

1.4. Simplifications 

The preceding paragraph is applicable whatever the null and alterna¬ 
tive hypotheses may be, provided only that all parameters of each 
hypothesis are uniquely specified. In particular problems, the procedure 
can usually be simplified by expressing log Pi and log P 0 directly in 
terms of the observations, then adjusting the terminat. on levels to simplify 
the expression. This is illustrated below for three cc*mmon tests: (1) for 
the probability of success in a binomial distribution; (2) for the mean in 
a normal distribution; and (3) for the difference in probability of success 
between two binomial distributi >ns. The arithmetic can be reduced in 
the first and third cases to mere counting and in the second case to simple 
addition. 


2. BINOMIAL DISTRIBUTION 
2.1. Algebraic Simplification of General Procedure 

Suppose it is required to determine whether the proportion p of suc¬ 
cesses, from a certain type of operation that results in either success or 
failure, is po. The alternative against which this is to be tested is that 
p = pi (px > po). The value of po may be the minimum acceptable 
proportion for the operation, it may be a value observed from long 
experience with similar operations, or it may have been deduced from 
theoretical reasoning. The value of pi may be the least excess over p 0 
worth detecting, the proportion expected on theoretical grounds, and so 
forth. Suppose that, after N operations, 8 successes have occurred. 
The probability of this happening (in the sequence actually observed) is 
pf( 1 — pi) N ~ 3 under the alternative hypothesis and pg(l — p 0 )*~ s 
under the null hypothesis. The likelihood ratio is therefore 



and its logarithm is 

The hypothesis that p = po is to be rejected, and the hypothesis that 
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p = pi accepted, if this is ever as great as a, that is, if S ever reaches 


<■+*»-« (HO 

r EL (i_-po)' 
g Lpo(i - Pi)J 


( 3 ) 


This is simply a linear function of N with positive intercept. Similarly, 
the hypothesis that p = po is to be accepted if S is ever as small as 


..log N 1 - Po) l 

g Lpo(1 ~ Pi) J 


( 4 ) 


another (and parallel) linear function of N with negative intercept. 


2.2. Graphical Procedure 


Thus, the simplest procedure is to prepare a chart with N on the 
horizontal and S on the vertical axis. On this chart the two parallel 
lines Si and So are drawn. After each observation is obtained, the total 
number of successes observed is plotted against the total number of 
observations made. If, for any A, S is as great as Si, discontinue the 
experiment and accept the hypothesis that p = pi. If, for any N, S is 
as small as So, discontinue the experiment and accept the hypothesis 
that p = p 0 . When S is between So and Si, no decision is possible with 
a risk as small as a or 0. A variation of this procedure is to plot the 
number of failures F (= N — S) on the horizontal axis instead of N , 
retaining S for the vertical scale. Then the two decision lines become, 
respectively, 





and 


Si 



( 5 ) 


( 6 ) 


Caution: Be sure that the same logarithmic base is used for a and 6 
as is used elsewhere in Si and S 0 or S[ and £fo* 
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3. NORMAL DISTRIBUTION 

3.1. Algebraic Simplification of General Procedure 

Suppose the standard deviation of a normal (Gaussian) distribution 
is known to be a and that it is required to test the null hypothesis that the 
mean m is m 0 against the alternative hypothesis that m = mi (mi > m 0 ). 
If N observations, xi, x 2 , . . . , x N} are obtained, the probability of this 
sequence is 

N 

(<r exp j- ^ |^V (Xi - w 0 ) 2 J| (7) 

l-l 

under the null hypothesis and the same expression, except with m i 
replacing m 0 , under the alternative hypothesis. From these probabilities 
the natural logarithm of the likelihood ratio is found to be 

-^”’(^--2--") (8) 

» - 1 

The hypothesis that m = mi is to be accepted if this is ever as great as a, 
that is, if Xx is ever as great as 

Ml - m ~~m + — t --° N (9) 

mi — m o z 

Similarly, the hypothesis m = m 0 is to be accepted if Xx is ever as small 
as 

"•-= Z ^+ 2! T !2 -’ J ' r 

mi — m o ^ 

3.2. Graphical Procedure 

As in the case of the binomial, prepare a chart with N on the hori¬ 
zontal axis and Xx on the vertical axis, and draw on it the two parallel 
straight lines Mi and M 0 , of which M\ has a positive and Mo a negative 
intercept. As each observation is obtained, add it to the sum of all 
previous observations and plot the total against N. If the plotted point 
is ever as high as Afi, terminate the experiment and accept the hypothesis 
that m = mi. If it is ever as low as M o, terminate the experiment and 
accept the hypothesis that m = mo. When it is between M 0 and M x , no 
decision can be reached without a risk of error excepdipg a or 0. 

Caution: Be sure that a and b are natural logarithms. (Natural 
logarithms are 2.3 times the corresponding common logarithms.) 
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4. DOUBLE DICHOTOMIES 


4.1. Null and Alternative Hypotheses 

Data from two different operations, each of which results in either 
success or failure, constitute a double dichotomy. To determine sequen¬ 
tially whether the proportions of successes are the same for the two 
operations (which is logically identical with testing whether the two 
operations are represented in the same proportion among the successes 
as among the failures), set up the null hypothesis that the ratio of suc¬ 
cesses to failures is the same for the two operations, and the alternative 
hypothesis that the ratio of successes to failures is u (u > 1) times as 
great for the second operation as for the first. Thus, if pi and p 2 are 
the proportions of successes for the two operations, the null hypothesis 

is that ' _" 2 j ® 1> which is equivalent to p\ = p 2 . The alterna¬ 
tive hypothesis is that ~ y j - — ^ = w, which is equivalent to the relation 

p 2 = upi/[l + p\{u — 1)]. Then collect observations in pairs, each pair 
containing one observation for each operation. , Those pairs in which both 
operations succeed or both fail are disregarded, since they throw no light 
on the question of which operation yields the higher proportion of suc¬ 
cesses. Only the residual pairs, in which one and only one operation 
results in a success, are considered in the remainder of this section; these 
residual pairs will hereafter be called successes if the operation expected 
under the alternative hypothesis to be superior produces a success. 


4.2. Algebraic Simplification of General Procedure 

Under the null hypothesis, the proportion of such successes among the 
residual pairs will be 3^. Under the alternative hypothesis, the propor¬ 
tion of such successes will be u/(l + u). Thus, the problem reduces 
to that of the binomial distribution discussed in Sec. 2. If S is the 
number of successes and if F is the number of failures from N residual 
pairs (S + F *= JV), the logarithm of the likelihood ratio may be 
written 


S log 


2 u 

1 +u 


+ F log 


2 

1 + u 


(id 


The alternative hypothesis (that the second operation is superior) is 
to be accepted if this is ever as great as a; this will occur if, for any F, S 
is as "much as 
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Si 


a + F log 


1 + u 


log 


2 u 

1 + u 


( 12 ) 


a linear function of F with positive intercept. Similarly, the null 
hypothesis (that the two operations are equally good) is to be accepted 
if the logarithm of the likelihood ratio is ever as small as — 6, which will 
occur if, for any F, S is as small as 


So 


— b+F log 


1 + u 


log 


2 u 

T+u 


(13) 


a linear function of F , parallel to Si but with negative intercept. 

* 

4.3. Graphical Procedure 

Thus, the simplest procedure is to prepare a chart on which the hori¬ 
zontal axis represents F, the number of residual pairs in which a failure 
has occurred with the operation expected under the alternative hypothe¬ 
sis to be superior, and the vertical axis represents S , the number of residual 
pairs in which that operation has succeeded. Draw on this chart the 
two parallel straight lines So and Si. After each residual pair is observed, 
plot a point whose coordinates represent the total number of successes 
and failures among all residual pairs obtained so far. If, for any value 
of F , S is as great as Si, halt the experiment and conclude that the second 
operation is superior. If, for any value of F , S is as little as So, halt the 
experiment and conclude that the two operations are equally good. 
When S is between So and Si, no conclusion can be reached without a 
risk of error in excess of a or j3. An alternative procedure is to let the 
horizontal scale represent the number of residual pairs N ( = S + F), in 
which case the decision lines become, respectively, 


and 


«- 


a + N log 


1 + u 


log u 


SI 


—b + N log 


1 + u 


log u 


(14) 


(15) 


Caution: Be sure that the same logarithmic base is used for a and b, 
as is used elsewhese in S 0 and Si or S' 0 and S[. 
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5. OTHER APPLICATIONS 

5.1. Simple Problems 

Obviously, the general procedure and the simplifications described 
in Secs. 1.3 and 1.4 can be applied to a wide range of problems. The 
application is straightforward whenever the problem is simple, in that a 
single parameter 0 is under investigation and the purpose of the analysis 
is to discriminate between a null hypothesis that this parameter lies 
below [above] some level 0 O and an alternative hypothesis that it lies 
above [below] another level 0i, where 0i> 0 O [0i < 0o]. For example, 
procedures for testing the mean of a Poisson distribution and for testing 
the standard deviation of a normal distribution are easily derived from 
the general procedure. More complicated problems can sometimes be 
reduced to a simple form for which the procedure is straightforward, as is 
illustrated by the case of double dichotomies. 

6.2. Complex Problems 

Sequential procedures have also been worked out successfully for 
certain more complicated problems. Important examples are (1) test¬ 
ing the null hypothesis that the mean of a normal distribution has a 
certain value m 0 against the two-sided alternative hypothesis that the 
mean is either greater than mi or less than 2m 0 — mi (mi > m 0 ); (2) testing 
the same null and alternative hypotheses as in Sec. 3, but with no infor¬ 
mation about the standard deviation other than that given by the sequen¬ 
tial sample (the sequential equivalent of Students test); (3) testing the 
standard deviation of a normal distribution, with no information about 
the mean other than that given by the sequential sample; (4) testing the 
fraction of a normal distribution exceeding a certain level, with no 
information about the mean or standard deviation other than that 
given by the sequential sample. 

6. OPERATING CHARACTERISTICS 

6.1. General 

Sequential tests are determined by four quantities, the values 0 O 
and 0i specified by the null and alternative hypotheses for the parameter 
being investigated and the values a and f$ of the two risks of error. For 
simple problems involving independent observations, for example, those 
described in Secs. 2, 3, and 4, the sequential-test procedure involves only 
three parameters, the slope s and the two intercepts —hi and h 2 of the 
two decision lines. Obviously, then, many combinations of 0o, 0i, a, and 
P lead to the same sequential test. By finding these combinations, we 
can determine the probability that the test will lead to acceptance of 
the null hypothesis for any possible value of 0, not merely 0o and 0i. 
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The function (curve) giving the probability of acceptance Lb for every 
possible value of 0 is known as the operating-characteristic function 
(OC curve) of the test procedure; and 1 — Lb is known as the power 
function. 

6.2. Five-point OC Curves 

Although the equations of these curves are known, it ordinarily 
suffices to find five easily determined points on them. Two points are 
known from the fact that at the extreme values of 0, for example, at 
p = 0 and p = 1 in the case of the binomial distribution, the probabilities 
of acceptance are 1 and 0. Two more points are known from the fact 
that the test was obtained in such a way that at 0o and 0i the probabilities 
of acceptance are 1 — a and / 8. A fifth, and intermediate, point results 
from the fact that the probability of accepting the null hypothesis is 

hi + h2 

(which is 3^ if ol =*= 0) when 0 is such that the expected value of each term 
in the cumulated sum is for example, when the parameter of a binomial 
distribution is p = s, when the mean of a normal distribution is m = s y 
or when the standard deviation of a normal distribution is a = \Zs. 

7. AVERAGE NUMBER OF OBSERVATIONS 

7.1. General 

The average number of observations necessary to reach a decision 
will depend upon the risks of error and upon the divergence between 
the null and alternative hypotheses. N will be greater the smaller a, 
the smaller 0, or the smaller the divergence between the null and alterna¬ 
tive hypotheses. Furthermore, it will have one value, No, when the null 
hypothesis is true and another, Ni, when the alternative hypothesis is 
true; in general its value Ne will depend upon the true value 0 of the 
parameter being tested. The function (curve) giving the average 
number of observations for every possible value of 0 is known as the 
average sample number function (ASN curve). Its equation may be 
written 

R, = (1 ~ — — ( 16 ) 

Zb 

where L$ is the probability, when the true value of the parameter is 0, 
that the test will result in accepting the null hypothesis, and l$ represents 
the average value of the logarithm of the likelihood ratio when the true 
value of the parameter is 0. In the case of the binomial distribution, 
for example, the likelihood ratio is either pi/po (when a success occurs), 
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or (1 — pi)/(l — Po) (when a failure occurs). If the true proportion of 
successes is p, 

'*.-!> 1<« * + (X — p) log (i-=-El) (17) 

7.2. Five-point ASN Curves 

Five points on the ASN curve are readily obtainable from the live 
known points on the OC curve. Two points are obtained from the 
extreme values of 0 and the corresponding probabilities of acceptance, 
1 and 0. Two more points, corresponding with 0 O and 0i, are obtained 
by taking L$ equal to 1 — a and /3, respectively. The fifth, intermediate, 
point is obtained by multiplying together hi and h 2 and dividing the 
product by the variance of* a term in the cumulated sum when 6 is such 
that the expected value of each term is s; for example, the divisor of 
h\h 2 in testing the binomial proportion is s(l — s), in testing the mean 
of a Poisson distribution it is s, in testing the mean of a normal distribu¬ 
tion of known standard deviation a it is <r 2 , in testing the standard 
deviation of a normal distribution it is 2s 2 . This fifth value, N B , is usually 
(but not necessarily) near the maximum of the ASN curve. 

For a given value of 0, sampling fluctuations will cause N to vary 
from one sequential analysis to another, using the same values of hi y h 2y 
and s. The distribution of N is skewed in such a way that most samples 
require fewer than Ne observations; the variability is such that the 
sample size rarely exceeds 3 Ne- For usual combinations of 0 O , 0i, a, and 
j3, the ASN curve falls off sharply from its peak and ATo and N i are 
substantially less than i?,; this is not necessarily the case, however, 
and in fact it is possible for the maximum of the ASN curve to occur 
outside the interval 0o to 0i, although 0 = $ is always between 0 O and 0i. 
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237-244 
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plan, 26 
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plan, 26-28 
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curtailment of, 14 

efficiency of, compared with sequential 
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efficiency of, compared with sequential 
variables plan, 26n., 85 
efficiency of, compared with single-sample 
attributes plan, 26-28 
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definition of, 14 
plan of action for, 14, 16-21 
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limit, 227-234 

formulation in terms of <r, 229-231 
multivariate, 50-52, 156, 157 
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single "score,” 51, 113, 114 

based on generalised Student’s T, 114-110 
simultaneous use of several sample criteria, 
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for standard deviation of quality, sequential 
sampling, 232-234 

single-sample, 231, 232, 270, 271, 278-280 
upper and lower limits on single quality char¬ 
acteristic, 52—56, 63, 64 
when lot quality is measured by fraction of 
sample ranges that are less than specified 
limit, 227-234 

sequential sampling, 232, 234 
Acceptance line, 232, 356, 357, 359, 360, 422, 423, 
425 

adjusting rejection line to take account of, 
358-360 

figure showing, 357 

for number of "sucoess-failure” pairs, in double¬ 
dichotomy test, 425 

for number of "successes” in binomial test, 422 
for sum of observations, in test for p, <r known, 423 
for sum of squared deviations, in test for <r, n 
• unknown, 232 
of unit slope, 356, 357, 360 

(£m alto Aooeptanoe criterion; Acceptance 
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Acceptance number (A*, or So), 237, 239, 244, 
422, 425 

constant, in sequential analysis, 356 

adjusting rejection line to allow for, 358, 359 
example of, 359-361 
definition of ( Ak ), 237 
in double sampling, 237 
examples of, 239, 244 
infinity, 358 

in multiple sampling, 237 
examples of, 239, 244 
omitted, 358 

relation to rejection number characterises 
type of sampling plan, 237 
in Sequential sampling (A*), 237 
in sequential tests (&>), 422, 425 
in single sampling, 237, 260n., 263, 264 
Acceptance region, 38-41, 52-57, 07-71, 232, 233, 
355-361, 377, 422, 423, 425 
boundaries of, for sequential binomial test, 422 
for sequential double-dichotomy test, 425 
for sequential mean test, a known, 423 
for sequential procedure for determining 
whether to complete or abandon an 
experiment, 356-361 
figure showing, 357 

for sequential standard deviation test, n 
unknown, 232-233 

for single-sample variables inspection plans 
foi* fraction defective, 35-37, 41, 53-57, 
67-71 

figures showing, 38-40, 55 
for "immediate acceptance”, 34, 37, 41, 07, 
69-70 

determined, by constant acceptance number, 
356-361 

figure showing, 357 

by pair of one-sided criteria based on £ 
and t, 52-54 
figure showing, 55 

by sequential-sampling procedures, 232- 
233, 422, 423, 425 

by sign of difference between sample means, 
377 

by single one-sided criterion based on £ and t, 
36-41, 67-71 
figures showing, 38-40 

by single two-sided criterion based on £ 
and «, 52, 54-56, 57 
figures showing, 38-40, 55, 357 
for "immediate acceptance,” in variables 
inspection for fraction defective, 34, 37, 
41, 67, 09-70 

Addition theorem, for x*. 163 
Adjusted data, use of, in variables inspection for 
fraction defective, 36-41, 54, 69-71 
Adjusted value, of chi (xe), 254-265, 257 
general procedure for obtaining, 254n. 
of chi-squared (x«*)> 254, 255 
general procedure for obtaining, reference 
giving, 254n. 

of phi (ip B ), formula for, 401 
table of, 406 
(Stt alto Adjustment) 
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Adjustment, Bartlett’s, 390 

heuristic derivation of, 413-414 
table to facilitate, 406 

of chi (x), when calculated from 2X2 table, 
254, 255, 257 

general procedure for, 254n. 
of chi-squared, when calculated from 2X2 
table, 254-255 

general procedure for, reference giving, 254n. 
constant (A), in graphical procedure for 
variables inspection for fraction defective, 
36, 37, 69-71 

optimum choice of, 41, 54, 70, 71 
of inverse-sine transformation, 399, 405-407, 
407n. 

heuristic derivation of Bartlett’s, 413, 414 
table to facilitate Bartlett’s, 406 
of k, when N is fixed, 19, 20 
of N, by modifying OC curve, 19 
of rejection line, to allow for fixed acceptance 
line, 358, 359 
example, 359-361 
of «, to eliminate bias, 58 

of starting point, after first two rounds of 
sequential procedure for looating a maxi¬ 
mum, 368, 369 
Yates’, 254, 255, 257 
Air testing of bombsights, 113-184 

arrangement of bombings for, 121, 122 
function of, 117 

multivariate quality control of, 113-184 
problems of, 116 

proposed sampling plan for, 119-121 
a: probability of erroneously rejecting hypothesis 
under test, 271, 282, 283, 419 
taken equal to 0.50, 283 

use in determining sequential test procedures, 
420, 421 

(See also Error, of the first kind) 

«: probability that lot of acceptable quality will 
be rejected, 16 
(See also Producer’s risk) 
a: true intercept of linear regression equations, 
322 

estimation of, when cost of each measurement 
depends upon its magnitude, 322-328 
a-level of significance, 270, 278-279, 281 
Alternative hypothesis, in analysis of variance, 
311-313 

for which probability of error of second kind 
is 0, 61, 231. 260n., 263, 264, 271, 279, 313, 
380, 421, 423, 424 

in tests of means, 61, 377, 379-380, 386, 423 
of proportions (percentages), 260n., 263, 264, 
357-358, 421, 424 

of standard deviations (variances), 231, 270, 
271, 278, 281, 311-313, 387 
American Society for Testing Materials,. 58n., 93 
American Standards Association, 58n., 93 
Amount of information, about dependence of 
one variable on another, 321 
maximising, when cost of each measurement 
depends upon its magnitude, 319-328 


Amount of information, about a function in 
neighborhood of maximum, 366 
provided by factorial experiment, 366, 367, 
370, 371 

provided by sequential experiment, 368, 369, 
370, 371 

about population mean (/*), provided by probit 
data, 345, 346 
maximizing, 349-352 
provided by sample mean (£), 161 
provided by sample median, 161 
about population standard deviation («r), 
provided by probit data, 345, 346 
maximizing of, 349-352 
. provided by sample mean deviation, 162, 163 
provided by sap pie standard deviation (a), 
162. 229 

about quality of, ot. provided by sampling 
inspection, 156-158 
when inspection is by attributes, 7 
when inspection is by variables, 7 
(See also Amount of inspection) 

Amount of inspection, 14, 15, 17-21, 26-28, 
60-66, 81, 82, 85, 100-109, 231, 233, 234, 
235-244, 259, 260, 263, 264, 274, 275, 278, 
280, 282, 283, 311-314, 319-328, 331-338, 
358, 362n., 380, 381, 419, 420, 427, 428 
with attributes plans, 14, 15, 26-28, 237-244, 
259, 260n., 263, 264, 427, 428 
comparison of double, multiple, and se¬ 
quential sampling, 237-244 
figures showing, 240-243 
contrasted with variables plans, 26-28 
formula for determining, with sequential 
sampling, 427, 428 

for single sampling, 259, 260n., 263, 264 
nomogram for determining, for single 
sampling, 260 
method of using, 260n. 
when severity of any one test cannot be 
controlled precisely, 331-338 
average, 14, 15, 26, 85, 233, 234, 235-244, 358, 
362n., 419, 420, 427, 428 
for comparison of two proportions, by 
sequential sampling, 424, 427, 428 
with curtailed (truncated) sampling, se¬ 
quential, 362n. 
single, 14, 15 

for double dichotomies, by sequential 
sampling, 424, 427, 428 
with double sampling, compared to multiple 
and sequential, 237-244 
figures showing, 240, 242 
with multiple sampling, compared to double 
and sequential, 237-244 
figures showing, 240, 242 
with sequential analysis, contrasted with 
non-sequential sampling, 26, 85, 235- 
244, 419, 420 

general formula for one-parameter tests, 
427 

with sequential attributes plans, 26, 235-244, 
427, 428 

when acceptance line is omitted, 358 
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Amount of inspection, average, with sequential 
attributes plans, contrasted with 
double- and multiple- sampling 
attributes plans, 237-244 
figures showirife, 240, 242 
contrasted with single-sample attributes 
plans, 26 

formula for, 427, 428 

with sequential one-decision binomial test, 
358, 427, 428 

with sequential one-sided binomial test, 
formulas for, 427, 428 
figures showing, 240, 242 
with sequential one-sided test for «■, when n 
is unknown, formula for, 233 
figures showing, 234 

with sequential variables inspection plan 
for fraction defective, 85 
(See also ASN curve) 

when cost of each measurement depends upon 
its magnitude, 319-328 

in curtailed (truncated) inspection, 14, 15, 362n. 
when decisions are based on signs of differences 
between sample means, formula for, 380 
determination of, nomogram for, 260 

procedures for, 17-21, 27n., 28n., 64-66, 
81, 82, 231, 274, 275, 278, 280, 282, 283, 
311-314, 331-338, 362n., 427, 428 
for analysis-of-variance tests, 311-314 
for x 3 -tests of variance estimates, 231, 
274, 275, 278 

for curtailed (truncated) sequential sam¬ 
pling, 362n. 

for F-tests of variance estimates, 280, 282, 
283, 296, 311-314 

for one-decision sequential binomial test, 
358, 427, 428 

when severity of any one test cannot be 
controlled precisely, 331-338 
for standard deviation tests, 274, 275, 
278, 280, 282, 283 

for variables inspection plans for fraction 
defective, 17-21, 27n., 28n., 64-66, 81-82 
differences in, attributes versus variables 
inspection, 26-28 

with • double, multiple, and sequential 
sampling, 237-244 
figures showing, 240-243 
sequential versus non-sequential sampling, 
26, 85, 237-244, 419, 420 
figures showing, 233, 240-243 
distribution of, in repeated sequential sampling, 
428 

with double sampling, compared to multiple 
and sequential, 237-244 
figures showing, 240-243 
figures showing dependence on quality sub¬ 
mitted, 234, 240-243 
fixed in advance, adjustment of fc, 19, 20 
formulas for, 17, 27n., 28n., 60-63, 231, 233, 
. 269. 260, 263-264, 278n., 427, 428 
in attributes inspection, with sequential 
. aampling, 427, 428 
With single sampling, 269, 260, 263, 264 


Amount of inspection, formulas for, for binomial 
test, one-sided, with sequential sam¬ 
pling, 427, 428 

with single sampling, 259, 260n., 263, 264 
when decisions are based on signs of differ¬ 
ences between sample means, 380 
for difference of means test, single sample, 
when a is known, 61, 62 
when <r is unknown, 17, 27n., 28n., 60-66, 
81, 82 

for difference of proportions (percentages) 
test, sequential sampling, 424, 427, 428 
single sample, 259, 260, 263, 264 
incorrect formula for, 258 
for double-dichotomy homogeneity test, 259, 
260, 263, 264 

for fraction defective test based on variables, 
single sample, 17 

when n is known and <r unknown, 27n., 
28n., 231, 278n. 

when n is unknown and a known, 27n., 
28n., 62 

when n and <r are both unknown, 17, 27n. f 
28n., 60-63 

for general one-parameter tests, sequential 
sampling, 427 
single sampling, 61, 62 
for mean test, single sample, with or known, 
27n., 28n., 62 

with oi unknown, 17, 27n., 28n., 60-66, 
81, 82 

for proportion test, with sequential sampling, 
427, 428 

with single sample, 259., 260n., 263, 264 
when range of samples of k determines lot 
quality, with sequential sampling, 233 
with single sample, 231, 278 n. 
for setting tolerance limits, 101, 108, 109 
for standard deviation test, with sequential 
sampling, 233 

with single sample, 231, 278n. 
for variables inspection plans for fraction 
defective, with single sample, 17 
when n is known and «r is unknown, 27n., 
28n., 231, 278n. 

when n is unknown and <r is known, 27n. f 
28n., 62 

when ft and <r are both unknown, 17, 27n., 
28n., 60-63 

minimum acceptable, when severity of any 
one test cannot be controlled precisely, 
331-338 

minimum proposed, when data are coarsely 
grouped or rounded, 213, 214 
with multiple sampling, compared to double 
and sequential, 237-244 
figures showing, 240-243 
nomogram for determining, 260 
uses of, and procedures, 260 
normal, 45 
reduced, 45 

reduction in, by efficient utilisation of (fata, 
160-163 

by knowing m or *, 27, 28 
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Amount of inspection, reduction in, by using 
double sampling, 237-244 
figures showing, 240, 242 
by using efficient formulas, 160-163 
by using multiple sampling, 237-244 
figures showing, 240, 242 
by using sequential sampling, 26-28, 85, 233, 
237-244, 369-371, 419, 420 
figures showing, 233, 240, 242 
by using variables inspection, 26-28 
with sequential sampling, 26, 26n., 85, 233-244, 
362n., 419, 420, 427, 428 
when acceptance line is omitted, 358, 358n. 
in attributes inspection, 26, 235-244, 427, 428 
contrasted with double and multiple 
sampling, 237-244 
figures showing, 240-243 
contrasted with single sampling, 26 
contrasted with non-sequential sampling, 
26-28, 85, 235- 244, 419, 420 
with curtailed sampling, 362n. 
formula for, general one-parameter tests, 427 
one-sided binomial test, 427, 428 
one-sided test for <r, when n is unknown, 233 
procedures for determining, curtailed sam¬ 
pling, 362n. 
general, 427 

one-decision binomial test, 358, 427, 428 
in variables inspection for fraction defective, 
85 

for setting tolerance limits, 100-108 
with single sample, 14, 15, 17 -21,26 28,60-66, 
81, 82, 85, 231, 259, 260, 263, 264, 274, 275, 
278, 280, 282, 283, 311-314, 331 338, 380, 
381 

analysis-of-variaiice tests, 311 314 
attributes plans, 14, 15, 26-28, 259, 200, 
263, 264 

contrasted with sequential sampling, 26, 
419, 420 

contrasted with variables plans for 
fraction defective, 26-28 
binomial tests, one-sided, 259, 260, 260a., 
263, 264 

X Meats of standard deviation (or variance), 
231, 274, 275, 278 

contrasted with sequential sampling, 26, 85, 
419, 420 
curtailed, 14, 15 

when decisions are based on signs of differ¬ 
ences between sample moans, 380 
difference of means test, when a is known, 
61, 62 

difference of proportions (or percentages) 
test, formula, 259, 260, 263, 264 
nomogram for, 260 

double-dichotomy homogeneity tests, 259, 
260, 263, 264 

for F-tests of standard deviations (or 
variances), 280, 282, 283, 311-314 
general formula for one-parameter tests, 61- 
62 

for mean test, v known, 27, 28, 62 

« unknown, 17, 27n„ 28n., 60-66, 81, 82 


Amount of inspection, with single sample, for 
proportion test, 259, 260n., 263, 264 
when range in samples of k determines lot 
quality, 231, 278n. 

for standard deviation tests, 231, 274, 275, 
278, 280, 282, 283 

for variables inspection plans for fraction 
defective, 17-28, 60-66, 81, 82 
when fi is known and v unknown, 27n., 
28n., 231, 274, 275, 278 
when n is unknown and 9 is known, 27, 28, 
62 

when n and 9 are both unknown, 17-21, 
26-28, 60-66, 81, 82 
standard deviation of, 238, 241, 243 

computational methods for determining, 244 
figures shewing, 241, 243 
tables showing, for ar alysis-of-variance tests, 
313». 

for attributes plans, 26 
for setting tolerance limits, 102-107 
for standard deviation tests, 272, 273, 276, 
277, 284-295, 298-309 

for variables inspection for fraction defective, 
22-27, 05 

with truncated sampling, sequential, 362n. 

single, 14, 15, 238 
variability of, 235-244 

computational methods for determining, 244 
figures showing, 241, 243 

nature of, in repeated sequential sampling, 428 
Analysis of consumption requirements, 47-49 
Analysis of experimental error (see Analysis of 
variance) 

Analysis of production possibilities, 46, 47 
Analysis of variance, 297, 310-314, 385, 380 
algebraic expression for coefficient of between- 
group variance, 312 
for degrees of freedom in, 312 
for mean squares in, 312 
expected values of, 312 
for sums of squares in, 312 
of binomial data, 398, 407, 408 
components of random variation in, 297,310,311 
resolution by, 311-314 
components of systematic variation, 310, 311 
components of variance, resolution by, 311-314 
of percentage data, 398, 407-408 
planning experiments leading to, 297, 310-314 
skeleton, 312 

systematic versus random variation in, 297, 
310-311 
table, 312 

tests of significance in, 312-314, 385, 386 
for components of variance, 312-314 
for differences among subgroup means, 385, 
386 

(See also Standard deviation; Variance) 
Angular difference, between two binomial prob¬ 
abilities, 262, 264 

between two observed proportions, 264 
Angular transformation, 267-259, 261-264, 336, 
395-416 

(See also Inverse-sine transformation) 
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Animal experiments, 311, 341, 342, 343n., 352 
Applied Mathematics Panel, iii, vii, x 
Summary Technical Report of, vii 
Approximations, involved in formulas for vari¬ 
ables inspection for fraction defective, 21, 
64-66 

effects of, 21, 49, 50, 62-64 
table showing, 65 

use of non-central t to avoid, 64-66 
AQL (see Acceptable-quality level) 
arcsin x, reference to table of, 260n., 265, 402n., 416 
Arcsine transformation, 257-259, 261-264, 336 
395-416 

(See also Inverse-sine transformation) 
Arithmetic mean, of sample measurements, 161 
as estimator of population (lot) mean, 12, 161, 
162, 192, 196-198, 200, 203, 212, 213 
formula for, 12, 100, 101, 161, 192 

when measurements are grouped (or 
rounded), 196 •* 

symbol for (£), 12, 161, 192 

when measurements are grouped or rounded 
(3>r), 196 

(See also Sample mean) 

Arithmetic-probability paper, 414 
Army, viii 

Arnold, Kenneth J., ix, x, xii, 6, 354, 419 
A8N (see Average sample number) 

Assignable causes, 269 

that introduce random components of varia¬ 
tion, 297, 310, 311 

systematic components of variation, 310, 311 # 
Asymptotic expansion, for confidence limits for 
population proportion, 336 
for percentage points of ^-distribution, 148 
for percentage points of ^-distribution, 155 
for tolerance-limit factor K, 101 
Attributes, inspection by, 7-9, 14, 15, 26-28, 
229n., 237-244, 259, 260, 263, 264, 331-338, 
358-362, 419-422, 427, 428 
advantages and disadvantages of, 7-9 
amount of inspection, 14, 15, 26-28, 237-244, 
259, 260n., 263, 264, 427, 428 
comparison of double, multiple and sequen¬ 
tial sampling, 237-244 
figures showing, 240-243 
comparison of sequential and single sampling, 
26, 419, 420 

comparison with variables inspection, 26-28 
formula for determining, with sequential 
sampling, 427, 428 

with single sample, 259, 260n., 263, 264 
nomogram for determining, with single 
sample, 260 

procedure for using, 260n. 
when severity of any one test cannot be 
controlled precisely, 331-338 
(Se« also Amount of inspection) 
to avoid normality assumption, 8, 229n. 
as basis for deciding whether to abandon or to 
complete an experiment, 355-362 
contrasted with inspection by variables, 7-9, 

i6-28 


Attributes, curtailment of, 14, 15, 238 
with double sampling, 237-244 
amount of inspection, average, 240, 242, 244 
variability of, 238, 241, 243, 244 
characterized by relation of Rk to Ah, 237 
computational procedures for, references to, 
244 

contrasted with multiple and sequential 
sampling, 237-244 

formulas, for acceptance line, with sequential 
sampling, 422 

for amount of inspection, with sequential 
sampling, 427, 428 

with single sampling, 259, 260w., 263, 264 
for ASN curve, with sequential sampling, 
427, 428 

for five-point OC curve, for sequential 
sampling, 427 

for OC curve, for single sampling, 260n., 
261-263 

for rejection line, with sequential sampling, 
422 

for rejection number (proportion), with 
single sample, 260n., 263, 264 
when inspection is costly, 9 
when inspection is destructive, 9 
with multiple sampling, 237-244 

amount of inspection, average, 240, 242, 244 
variability of, 238, 241, 243, 244 
characterized by relation of Rk to Ak, 237 
computational procedures for, references to, 
244 

contrasted with double and sequential 
sampling, 237-244 

nomogram, for determining amount of inspec¬ 
tion and OC curve for single sampling, 260 
procedure for using, 260n., 263 
OC curve, for sequential sampling, procedure 
for five-point, 427 

single sampling, formula for, 260n., 263, 264 
nomogram for, 260n., 263 
when range in samples of k determines lot 
quality, 229n. 
reasons for use, 7-9 

selection of AQL in, discussion of, 45-47 
with sequential sampling, 26, 235-244, 357- 
362, 419, 420, 421, 422, 427, 428 
acceptance line, formula for, 422 
when acceptance line is fixed in advance, 
358-362 

OC curve, procedure for, 361, 362 
when acceptance line is omitted, 357, 358 
OC curve, formula for, 358 
amount of inspection, average, 26, 240, 242, 
244, 419, 420, 427, 428 
variability of, 238, 241, 243, 244 
as basis for deciding whether to abandon or 
to complete an experiment, 357-362 
characterised by relation of Rk to Ak , 237 
computational procedures for curtailed (trun¬ 
cated) sampling, references to, 244, 
361n., 362ft. 
steps of, 361, 362 
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Attributes, with sequential sampling, contrasted 
with double- and multiple-sampling attri¬ 
butes inspection, 237-244 
contrasted with sequential variables inspec¬ 
tion, 26n. t 85 

single-sampling, attributes inspection, 26 
curtailed (truncated), 361, 362 
OC curve, when acceptance line is fixed in 
advance, procedure for, 361, 362 
when acceptance line is omitted, formula 
for, 358 

general five-point, formulas for, 427 
when sampling is curtailed (truncated), 
steps in determining, 361, 362 
rejection line, formula for, 422 
when severity of any one test cannot be con¬ 
trolled precisely, 331-338 
with single sample, 14 15, 26-28, 237, 238, 
250, 260n., 263, 204 

acceptance criterion for, 260n., 263, 264 
amount of inspection for, 14, 15, 26-28 
contrasted with sequential-sampling at¬ 
tributes inspection, 26-28 
contrasted with single-sample variables 
inspection, 26-28 
formula for, 259, 260n., 263, 264 
nomogram for, 260 

procedure for using, 260n. 
characterized by relation of Rk to Ah, 237 
contrasted with double, and multiple 
sampling, 237 

with sequential attributes inspection, 26 
with single-sample variables inspection, 
26-28 

curtailed, 14, 15, 238 
OC curve, formula for, 260n., 263, 264 
nomograph for, 260n., 263 
rejection criterion, 260n., 263, 264 
when tests are destructive, 9 
Authorship, ix 

Average (sec Arithmetic moan; Mean; Sample 
mean) 

Average error, 162 

(See also Mean deviation) 

Average sample number (ASN, N), 238, 420 
(See also ASN curve; Sample size, average) 
ASN curve, 233-234, 238, 240, 242, 244, 362, 427- 
428 

comparison of, for doublo-, multiple-, and 
sequential-sampling attributes plans, 237- 
244 

figures showing, 240, 242 
computation of, references to general methods 
of, 244, 362n. 

definition of, for general one-parameter tests, 427 
special cases, 233, 238 

for double-sampling attributes plans, figures 
showing, 240, 242 
figures showing, 234, 240, 242 
five point, for sequential one-parameter 
tests, 428 

formulas, for general one-parameter sequential 
test, 427 


ASN curve, formulas, for sequential one-sided 
binomial test, 427, 428 
with acceptance line omitted, 358 
for sequential one-sided test of <r, 233 
maximum of, position of, 428 
for multiple-sampling attributes plans, figures 
showing, 240, 242 

for sequential attributes plans, figures showing, 
240, 242 

formula for, 427, 428 

for sequential one-sided binomial test, figures 
showing, 240, 242 
formula for, 427, 428 

for sequential one-sided test of <r, figure showing, 
234 

formula for, 233 * 

Average sample size (al.J ASN curve; Sample size, 
average) » 

B 

B: critical value of likelihood ratio, in sequential 
analysis, 420 

B : function appearing in formulas for sequential 
variables inspection, 84 

b: critical value of minus X( — X) in sequential 
analysis, 84, 421 

b: function of N, K a , and Ki that appears in 
formula for k when N is fixed, 19, 59, 61 
formulas for, 20, 59, 61 

b : multiple of v/y/ 2(W”~Tj in formula for 
standard deviation of s, 58 
formula for, 58 
reference to table of, 58n. 

6: ratio of standard deviation of e to 

<r/V2(N — 1)* in normal samples, 58 
Baines, A. H. J., 281n., 297, 313, 314, 318 
Barlow's tables, 34, 93 

Bartlett, M. S., 183, 385, 386, 387, 393, 398, 399, 
400, 401, 403-407, 411, 413-415 
Bartlett’s adjustment, of inverse-sine transforma¬ 
tion, 399 

heuristic derivation of, 413, 414 
table to facilitate, 406 

Bartlett’s test of homogeneity of variance esti¬ 
mates, 183, 387, 388 

compared to F-test for homogeneity of sample 
means, 386 

as "omnibus test,” 385, 380 
optimum properties of, 386 
separation into parts, assignable to different 
factors, 386 

tables to facilitate application of, reference to, 
388n. 

Baten, W. D., 193n., 218n., 222 
Bennett, Rollin F., ix 

0: probability of erroneously accepting hypothesis 
under test, 271, 419 

taken equal to 0.50, 20, 21, 271, 274, 282, 283, 
313, 314 

use in determining sequential test procedures, 
420, 421 

(See also Error of the second kind) 
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0: probability that lot of unacceptable quality 
will be accepted, 16 

(See also Consumer’s risk) 

0: true slope of a linear regression line, 322 
estimation of, when cost of each measurement 
depends upon its magnitude, 322-328 
0i: Pearson’s measure of skewness, 197, 198 
0a: Pearson’s measure of kurtosis, 197, 198 
Beta function, incomplete-, 392, 393 

percentage points of, reference to table of, 338, 
394 

use to determine confidence limits for popula¬ 
tion proportion, 333, 334 
Bias, of bombing errors, 137, 147, 149 

of mean deviation as estimator of <r, 109n. 
of p as estimator of P, 397, 410 
of ip(p) and vb(p) as estimators of <p(P), 403, 404 
tables showing, 400-4C2 
of 8 as estimator of a, 58, 162 

adjustment factor, formula for, 58i 
tables of, references to, 58n. 
of ‘« J as estimator of a 2 , 192, 199, 412 
when data are grouped or rounded, 199 
of sample mean as estimator of population 
mean, 58, 162, 197, 198 
when data are grouped or rounded, 197, 198, 
212, 213 

of sample median as estimator of normal 
population mean, 162 

of sample proportion as estimator of population 
proportion, 397, 410 

when inverse-Bine transformation is used, 
403, 404 

tables showing, 400-402 
of statistical tests, if Sheppard's corrections aro 
used, 199, 200, 203 
from time tiends in data, 371, 372 
from use of inverse-sine transformation, 403, 
404 

tables showing, 400-402 
from use of Sheppard’s corrections, 199, 200, 203 
of £ as estimator of p, 68, 162, 197, 198 

when data are grouped or rounded, 197, 198, 
212, 213 

of £ + ks as estimator oi p -\- hr, 58 
Bigelow, Julian H., ix 
Binomial distribution, 250, 251, 421, 427 

ASN curve for sequential one-sided test of 
parameter of, figures showing, 240, 242 
formula for, 427, 428 
confidence limits for parameter of, 332-336 
belts giving eighty percent limits, 335 
ninety percent limits, 334 
ninety-five percent limits, 333 
ninety-nine percent limits, 332 
accuracy of, 335n. 
preparation of, 335n., 336n. 
sources of, 335n. 

determination of, from percentage points of 
incomplete beta function, 333, 334 
examples of, 334, 337 
by using inverse-sine transformation, 336 
• by using non-central {-distribution, 76, 
77, 336n. 


Binomial distribution, confidence limits for 
parameter of, formula for, for case of zero 
occurrences, 335 336 

relation of, to percentage points of incom¬ 
plete beta function, 333, 334 
design of experiments, for comparing param¬ 
eters of two, 258-263, 424, 427, 428 
for estimating parameter of a single, 23S, 
333-337 

for testing parameter of a single, 259, 200 h., 
261-264 

double dichotomies, analysis of, by means of, 
249-253, 255 
examples of, 251 

experiments for comparing parameters of two, 
analysis of data from, 249-258, 424, 425 
approximate single-sample method, based 
on binomial test, 249, 250, 255 
based on x«, 253-255, 256, 257 
based on inverse-sine transformation, 
257, 258, 263, 264 

based on normal deviates, 253-257, 263 
"exact” single-sample method for, 250-253 
by Fulchor-Zubin Item Analyzer, 257, 258 
incorrect method for, 256 
by sequential analysis, 424, 425 
by Zubin-Fulchcr Item Analyzer, 257, 258 
design of, 258 265, 424 

inverse-sine transformation of data from, 
257", 258, 263, 264, 397-408 
number of observations needed, with sequen¬ 
tial sampling, 424, 427, 428 
with single samples, 258-261, 264 
formula for, 258, 259, 264 
nomogram for, 260 
operating characteristics of (OC), 261 
OC curve of, determination by formula, 262 
from nomogram, 263 
figure showing, 262 
OC surface of, 261, 262 
plunning of, 258-265, 424 
sequential, 424, 425 

experiments for estimating parameter of a 
single, analysis of data from, 332-336 
design of, 238, 333-337 

inverse-sine transformation applied to data 
from, 336 

number of observations needed, 333-337 
sequential, 238n. 

experiments for testing parameters of a single, 
analysis of data from, 260n., 263, 264, 421, 
422 

ASN curves for double, multiple, and 
sequential sampling, 240, 242 
design of, 259, 260n„ 261-264, 427, 428 
inverse-sine transformation of data from, 
260n., 261-264, 397-408 
number of observations needed, with double 
sampling, 237-244 

figures showing, 240-243 
with multiple sampling, 237-244 
figures showing, 240-243 
with sequential sampling, 26, 237-244, 
419, 420, 427, 428 



INDEX 


439 


Binomial distribution, experiments for testing 
parameters of a single, number of 
observations needed, with sequential 
sampling, figures showing, 240-243 
formula for, 427, 428 

with single sampling, 14. 15, 20-28, 259, 
260n., 263, 264 
examples of (Table 1.3), 26 
formula for, 259, 200«., 263, 264 
nomogram for, use of, 260a. 

OC curve of, approximate determination 
from formula, 260n., 262 
from nomogram, 260n., 263 
exact determination of points on, using 
confidence-belt charts, 336rt. 
using percentage points of incomplete 
beta function, 334, 336n. 
planning of, 259, 260a., 261-264, 427, 428 
sequential, 421, 422, 427, 428 
one-decision test, 358 
formula for, examples, 251 
homogeneity of samples from two, testing 
249-258, 424, 425 

(See also Binomial distribution, experiments 
for comparing parameters of two) 
inverse-sine transformation of data from, 
257, 258, 259-264, 336, 397-408 
mean of, 397, 410 

moments of, third and fourth about mean, 410 
nomogram, for determining sample sizes, in 
testing parameters of, 260 
normal approximation to, 263, 264 
number of observations needed, for comparing 
parameters of two, 258-261, 264, 424, 
427, 428 

formula for, 258, 259, 264 
nomogram for, 260 

for estimating the parameter of a single, 
333-337 

for testing the parameter of a single, with 
double sampling, 237-244 
figures showing, 240-243 
with multiple sampling, 237-244 
figures showing, 240-243 
with sequential sampling, 26, 237-244, 
419, 420, 427, 428 
figures showing, 240-243 
formula for, 427, 428 

with single sampling, 14, 15, 26-28, 259, 
260'*., 263, 264 
examples of, 26 

formula for, 259, 260n., 263, 264 
nomogram for, use of, 260n. 

OC curves, of tests of parameter of, 262 
one-decision test of parameter of, 358 
planning experiments for comparing parameters 
of two, 258-263, 424, 427, 428 
for estimating parameter of a single, 238, 
333-337 

for testing parameter of a single, 259, 260n., 
261-264 

Poisson exponential approximation to, 426, 428 
sequential estimation of parameter of, reference 
to work on, 238n. 


Binomial distribution, sequential test, for com¬ 
paring two, 424, 425 

for testing parameter of a single, 421, 422, 
427, 428 

one-decision test, 358 

single sampling for comparison of two, 249-264 
for estimation of parameter of, 332-336 
for testing parameter of, 14, 15, 26- 28, 259, 
260«., 263, 264 

standard deviation of, 255, 256, 397 
variance of, 397, 410 
Binomial-probability paper, 414n. 

Biological assay. 341, 342, 352 
(See also Sensitivity tests) 

Biological experimentation, 311 
Bishop, D. J . 386n., 391 
Bivariate distribution, normal, 146 

relation of Hotelling’s generalized T to ex¬ 
ponent of, 114, 115 

Bliss, Chester I., 342a., 343n., 350n. f 352, 398, 
400, 405, 415 

Bombsights, air testing of, function of, 117 

multivariate quality control applied to, 111- 
184 

problems of 116 

Bowker, Albert H., ix, xi, 96, 101n., 109 
Breaking strength, 365 

(See also Metallurgical experiments) 

Breen Nancy Brixey, x 
Broad classes (see (1 rouping) 

Brown, Richard H., 343n., 352 
Brown University, ix 

Bureau of the Census, ix, xi, xii, 226, 354 
Bureau of Ordnance, Navy, 352 
Bureau of Standards (see Mathematical Tables 
Project; National Bureau of Btandards) 

C 

ci: ratio of expected value of root mean square of 
deviations from sample mean, to a, in normal 
samples, 58 

reference to tables of, 58w. 
relation to o, 58n. 

Calculating machines, comparison of makes and 
models, 34n. 

use of, to simplify computation of s, 33, 34 
Camp-Mcidell inequality, 49n., 221 
Carver, H. C., 194n„ 222 
Causes, assignable, 269 

that introduce components of random varia¬ 
tion, 297, 310, 311 

that introduce components of systematic 
variation, 310, 311 
chance, 297, 310, 311 
Cayley, Arthur, 191, 222 
Central tendency, measures of, 9 
Chance causes, 297, 310, 311 
Characteristics, operating, of sampling-inspec¬ 
tion plans and statistical tests (see Operat¬ 
ing characteristics; Operating-characteristic 
curves; Operating-characteristic surface) 
Charts, control (see Control charts) 
list of, xiv 
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Check list, of possible causes of Lack of control, in 
air testing of bombsights, 140 
of steps for calculating 6(f, to, «), 74, 75 
for calculating t(f, 5, e), 75, 76 
for computing OC curve, for any curtailed 
(truncated) sequential inspection (test) 
procedure, 361 

for single-sampling variables inspection 
plan for fraction defective, 32, 33 
for determining k , given N, pi, and 1 — o, 70 
given N, p*, and 0, 76 

for determining N and k, given pi, p 2 , a, 
and 0, 81, 82 

for determining p, given N, uo, and t, 76, 77 
for determing uu, given N, pi, and «, 76 
(See also Computation) 
of general likelihood-ratio sequential test, 
420, 421 

for selection of bombsights for test, and 
arrangement of bombings, lift-122 
of sequential procedures for locating a maxi¬ 
mum, 367, 368 

x«: chi(x) corrected for continuity, 254 

calculation of, from observed proportions (per¬ 
centages), 256, 257 
from 2X2 table, 254, 255 
significance levels of, 255 
X 2 (chi-squared): a variable having the x*-distri- 
bution, 73, 91, 270 
addition theorem for, 153 
adjusted value of, 254 

calculation of, from proportions (percentages), 
256, 257 

from 2X2 table, 254, 255 
correction for continuity, 254 
significance levels of, 255 
small expected numbers, 253 
correction for continuity, general procedure for, 
254n. 

distribution of, formula for, 270 

percentage points of, 108-110, 315n. 

Cornish-Fisher type expansion for, refer¬ 
ence to, 109 
relation of K a to, 258n. 
relation to partial sum of Poisson series, 109 
' tables of, 315n. 
relation of F-distribution to, 316 
of normal distribution to, 258n. 
of ^distribution to, 270, 315 
of T-distribution to, 155 
formulas for, special, 254, 256, 257 
significance levels of, 108-110, 315n. 

Cornish-Fisher typo expansion for, reference 
to, 109 

relation to partial sum of Poisson series, 109 
tables of, 315n. 

teat of homogeneity, of proportions (percent¬ 
ages), 256, 257 
of 2 X 2 table, 254, 255 
of varianee estimates, 387, 388 
(See also Bartlett’s test) 
use in planning and interpreting experiments 
- for testing standard deviation (or vari¬ 
ance) estimates, 269-280 


Class interval (see Grouping interval) 

Clopper, C. J., 335n., 338 

Cochran, William G., 254n., 265, 387-389, 392, 
393, 399, 407, 408, 415 

Cochran’s test, for significance of largest variance 
estimate, 383-394 

contrasted with Bartlett’s test for homogeneity 
of variance estimates, 386, 387 
example, worked, 388, 389 
significance levels of, tables of, 390, 391 
accuracy of, 388, 393 

computation of, method employed, 389-393 
interpolation in, method of, 388 

Coefficient, correlation, 413 

regression, estimation of, when cost of each 
measurement depends upon its magnitude, 
322-328 

of line fitted to probit data, 343 
estimation of, 343, 344 
relation to p and a , 343 
of s, for setting tolerance limits (see K) 

in variables inspection for fraction defective 
(see k) 

Columbia University, iii, ix-xi 

Committee on War Research of, x 
Division of War Research of, vii 
Statistical Research Group of, iii, vii-x, 6, 8, 20, 
26, 34, 42, 52, 83, 85, 93, 101, 108, 109, 143, 
184, 188, 231, 234, 237, 244, 257, 260, 335, 
3/)8, 361, 362, 400, 405, 419, 420, 428 
Trustees of, vii 

Combination of observations, 42, 43, 113, 114, 167 

Combinations of grouping-interval widths and 
sample sizes satisfactory for statistical tests, 
213, 214 

Combining estimates of p, 42, 43 
of <r, 42, 43 

probabilities from statistical tests, 167, 184 
tests for, 50-52, 113-116 

effect of, on probability of acceptance, 50, 51 
into a single “score,” 51, 113, 114 

based on generalized Student’s T, 114-116 
values of a or « a , 42, 43 
values of x, 42, 43 

Comparison, of two binomial distributions, 
design of and analysis of data from experi¬ 
ments for, 247-265, 424, 425, 427, 428 
of two means, design of and analysis of data 
from experiments for, 61, 62 
of two percentages (or proportions), design of 
and analysis of data from experiments for, 
247-265, 424, 425, 427, 428 
of two standard deviations (or variances), 
design of and analysis of data from experi¬ 
ments for, 267-318 

Components of variance, 269, 297, 310-314 

Compounding probabilities from statistical tests, 
167 

Computation, of acceptance line, 232, 233, 356, 
357, 423, 425 

acceptance number, 260n., 263, 264, 422, 425 
acceptance region, boundaries of, 36-41, 62-56. 

57, 67-71, 232, 233, 422, 423, 425 
adjusted value of chi or chi-squared, 254, 255 
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Computation, adjustment constant, optimum, 41, 
54, 70, 71 

analysis-of-variance table, 312 
angular difference between two binomial 
probabilities, 264 

between two observed propvrtions (or per¬ 
centages) , 262 

arithmetic mean, 12, 100, 192 
ASN curves for, 233, 244, 362n., 427, 428 
Bartlett’s adjustment, of inverse-sine trans¬ 
formation, 399 

Bartlett’s test, of homogeneity of variance esti¬ 
mates, 387, 388 
bias of 8 as estimator of <r, 58 

of a 2 as estimator of a 2 , when data are 
grouped, 199 

of £ + k8 as estimator of p -f ka, 58 
binomial distribution, 251 

Xc, from observed proportions or percentages, 
256, 257 

from 2X2 table, 254, 255 
components of variance, 310-314 
confidence limits, for p. when a is known, 197 198 
when <j is unknown, 99, 200-203, 213-215. 
for parameter of binomial distribution, 
333-336 

for population mean, 99, 197, 198, 200-203, 
213-215 

for population proportion, 42, 43, 333-336 
for a or <r 2 , 198-200, 200-203, 213-215 
control limits, for means, 147-149 
for ^-statistics, 152, 153 
for variances, 151 
«(/, to, «), 74, 75 

dosage-mortality curve, 342-344, 350, 352 
efficiency, of estimators of population param¬ 
eters, 160-163, 227n. 

of inspection plans and statistical tests, 
26 -28, 85 

(See also ASN curve) 

expected frequencies, in 2 X 2 table, 253, 254 
hi and h it 232, 233, 356, 357, 422, 423, 425 
K, approximate, 101, 108 
exact, 108, 109 
k, approximate, 17-19 
exact, 76-79, 81, 82 

L P , 28-33, 50-52, 76, 77, 80, 358, 361, 427 
L*, 282-296, 316, 317 
L p , 231, 271-279, 315 
Lc, 232, 233 
Lt, 262, 263 
N , approximate, 17 
exact, 81, 82 

No , 259-260, 260n., 263, 264 
Nm, 28n., 62 

N„ 27n., 28n., 231, 274, 275, 278 
Nt, 17-21, 26-28, 60-66, 81, 82 
number of observations needed, for analysis-of- 
variance tests, 311-314 
for attributes plans, with sequential samp¬ 
ling, 427, 428 

with single sampling, 259, 260n., 263, 264 
when severity of any one test cannot be 
controlled precisely, 331-338 


Computation, number of observations needed, 
for binomial test, one-sided, with sequen¬ 
tial sampling, 427, 428 
with single sampling, 259, 260n., 263, 264 
for x s tests of variance estimates, 231, 274, 
275, 278 

for curtailed (truncated) sequential sampling, 
362n. 

when decisions are based on Bigns of differ¬ 
ences between sample means, 380 
for determining whether new process (prod¬ 
uct) is more (less) variable than stand¬ 
ard, 231, 274, 275, 278 
whether one process (product) is more 
variable than another, 280, 282, 283, 296 
for difference of me^, \8 test, when a is known, 
with single sample, 61, 62 
when a is unknown, 17, 27n., 28n., 60-66, 
81, 82 

for difference of proportions test, with 
sequential sampling, 424, 427, 428 
single '.ample, 259, 260, 263, 264 
for double-dichotomy test, 259, 260, 263, 
264, 424, 427, 428 

for F-teRts of variance estimates, 280, 282, 
283, 311-314 

for fraction-defective test based on variables, 
single sample, 17 

vhen p is known, and a unknown, 27n., 
28n., 231, 274, 275, 278n. 
when p is unknown, and <r known, 27n., 
28 n., 62 

when p and c r arc both unknown, 17-21. 
27n., 28n., 60-66, 81, 82 
for general one-parameter tests, sequential, 

427 

single sample, 61, 62 

for mean test, single sample, a known, 27n., 
28n., 62 

cr unknown, 17-21, 27n., 28n., 60-66, 81, 82 
for one-decision sequential binomial test, 
358, 427, 428 

for proportion test, sequential sampling, 427, 

428 

single sample, 259, 260n., 263, 264 
when range in samples of k determines lot 
quality, sequential sampling, 233 
single sample, 231, 274, 275, 278n. 
for setting tolerance limits, references to. 
109, 110 

for standard deviation tests, 231, 274, 275, 
278, 280, 282, 283, 296 
for variables inspection plans for fraction 
defective, 17-21, 27n., 28n., 64-66, 81, 82 
OC curve, for analysis-of-variance tests, 310- 
314 

for attributes plans, sequential sampling, 
426, 427 

single sampling, 260n., 262, 263 
for binomial test, one-decision sequential, 358 
one-sided sequential, 426, 427 
truncated, 361 

one-sided single sample, 260n., 262, 263 
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Computation, OC curve, for x 2 -tests of variance 
estimates, 231, 232, 271-279, 315 
for comparison of two means, a known, 61, 62 
a unknown,30, 59-63, 71-73, 76, 77, 80 
for comparison of two proportions or per¬ 
centages, sequential, 424, 426 427 
single sample, 262, 263 
for comparison of two standard deviations 
or variances, single sampling, 282-296, 
316, 317 

when decisions are based on signs of differ¬ 
ences between sample means, 378, 379 
for double-dichotomy test, 262, 263 
for F-tests of variance estimates, 282-296, 
316, 317 

for fraction-defective test, based on variables, 
28-33, 76, 77, 80 

for general one-parameter tests, sequential, 
426, 427 

single sample, 61, 62 '« 

for mean test, a known, 61, 62 
for one-decision sequential binomial test, 358 
for proportion test, eloquential sampling, 
426, 427 

single sample, 260n., 202-263 
when range in samples of k determines lot 
quality, sequential, 233 
single sample, 231, 271-279, 315 
for standard deviation tests, 231-233, 271- 
279, 282-296, 315-L17 

for variables inspection plans for fraction 
defective, 28-33, 76, 77, 80 
ip and ipB, 399, 401 
*(«,0,ni,n 2 ), 282-29C, 310, 317 
Up, for one-decision sequential binomial test, 
358 

p(a,(3,n), 231, 271-279, 315 
8 or «=, 12, 33, 34, 162, 192 
significance levels, of Cochran’s test, 389-393 
of T-distribution, 155 
of T’-statistics, approximate, 155 
of /-distribution, approximate, 148 
T (angular difference), 262 
T (Hotelling’s), 142, 154 
T’-statistics, 131, 133, 135, 145 
/(/,*,«), 75, 76 
tolerance limits, 97-109 

two-point estimate of p and a from probit 
data, 350 
uo, 76 

2, 12, 100, 192 

yardsticks, for differences of means, 150 
for ratio of two variance estimates, 152 
for T-statistics, 156 
Computer’s rule (see Rounding rule) 

Comrie, L. J., 93 
Conditional probability, 41, 251 
Confidence in conclusions, 98, 269 
Confidence belts, accuracy of, 335n. 
for population proportion, 332-336 
eighty percent, 335 
ninety percent, 334 
~ ninety-five percent, 333 
ninety-nine percent, 332 


Confidence belts, preparation of, 335n., 336n. 
sources of, 335n. 

Confidence coefficient ( 7 ,r), 43, 98, 100, 108, 332, 
336 

definition of, 98 
interpretation of, 43 

Confidence intervals, 42, 43, 99, 100, 332-336 
contrasted with statistical tolerance ranges, 
99, 100 

definition of, 42 

dependence of width of, on confidence coeffi¬ 
cient, 43 

on sample size, 99, 100 

for parameter of binomial distribution, 332-336 
for population proportion, based on attributes 
inspection, 332- 336 
bused on variables inspection, 42—45 
for process-average percent defective, bused on 
attributes inspection, 332-336 
based on variables inspection, 42, 43 

(See also Confidence belts; Confidence 
limits) 

Confidence limits, 99, 100, 196-203, 332-336 
contrasted with statistical tolerance limits, 99, 
100 

effect of grouping (rounding) on, 192, 196-203, 
213-215 

for p, when a is known, 197, 198 

based on grouped (rounded) data, 197, 198 
when a is unknown, 99 

based on grouped (rounded) data, 200-203, 
213-215 

for parameter of binomial distiilmtion, 332-336 
belts giving, 332-335 (see Confidence belts) 
determination of, accurate, from percentage 
points of incomplete beta function, 
333, 334 

examples of, 334, 337 

approximate, by means of inverse-sine 
transformation, 336 

by means of non-central /-distribution, 
76, 77, 336n. 

formula for, accurate, for case of zero occur¬ 
rences, 335 
approximate, 336 

relation of, to percentage points of incom¬ 
plete beta function, 333, 334 
for population mean, 99, 197, 198 
for population proportion, 332 336 
for population standard deviation (or variance), 
based on rounded data, 198-200, 213-215 
(See also Confidence belts; Confidence inter¬ 
vals) 

Consumer’s risk (0), conventional value of (10%), 
46 

definition of, 16, 228, 229 

effect of reduced or tightened inspection on, 45 
maximum of, with variables inspection plans 
for fraction defective, 16, 17 
selection of, factors to consider in, 45-49 
Consumption requirements, analysis of, 47-49 
Contingency tables, 2 X 2, 249-265, 424, 425 
(see Binomial distribution, experiments for 
comparing parameters of two) 
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Contracts, vii 

Control charts, applied to air testing of bomb- 
sights, 122-142 

some novel features of, 119-147 
trouble shooting with aid of, 137-140 
Control limits, for means, 147-150 

allowance for sampling variation of data in 
base period, 119, 147, 148 
for T-statistics, 152-155 
for variances, 151, 152 
(See also Quality control) 

Cornish-Fishcr expansion, 109 
Correction, Bartlett's, 399 

heuristic derivation of, 413, 414 
table to facilitate, 406 

of chi or chi-squared, for continuity, 254, 255, 
257 

general procedure for, 254n. 
factors: a, b, and a, 58 
for grouping, Sheppard's, 193-196, 222-223 
not to be used in connection with confi¬ 
dence limits or tests of significance, 199, 
200, 203 

of inverse-sine transformation, 399, 405-407, 
413, 414 
Yates’, 254, 255 

(See also Adjustment) 

Correlation coefficient, mean of, approximate, 413 
transformation of, to stabilise variance, 413 
variance of, approximate, 413 
z -transformation of, 413 
Cost, of experiment, 269 

of measurements dependent upon their magni¬ 
tudes, 321, 322 

linear regression analysis when, 322-328 
Craig, Cecil C., 93, 194n., 195n., 196n., 222 
Critical level of a normally distributed test func¬ 
tion, derivation of, 61, 62 
Crum, W. L., 145n., 183 

Cumulative distribution, estimating n and <r from 
data on, 339-352 

optimum design of experiments for, 345, 346, 
349-352 

Curtailment, of sequential test, 361 

of single-sample inspection plan for fraction 
defective, based on attributes, 14, 238 
based on variables, 14 
Curtiss, John H., 407, 410, 415 
Curve fitting, when cost of each measurement 
depends upon its magnitude, 321-328 
when data are observations on cumulative 
distribution, 342-352 
when data are probits, 342-352 
to data from sensitivity tests, 342-352 

D 

d: distance from target, 342 
Daniels, H. E., 297n., 318 

Data, adjusted, in variables inspection for fraction 
defective, 36-41, 54, 69-71 
amount of (see Amount of inspection; N; N; 
Sample sise) 


Data, on attributes, 7 
bias of, 160, 371 

combination of, 42, 43, 113-116, 167 

correlated, 50-52, 113-116 

on cumulative distribution, 339-352 

grouped,187-215 

probit, 339-352 

representative, 160 

rounded, 187-223 

selection of, reasons for random step in, 158- 
160 

tabular, 215-221 
time trends in, 371 

transformation of, to stabilise varianoe, 408- 
413 

on variables, 7 * 

Davis, O. L., 389n., 393 
Decimals, significant, )b9n., 190n. 

Decision function, 381, 382 
Decisions, based on inspection by attributes, 
7, 237 

by variables, 7 

based on sign of difference between sample 
means, 377 

conditions rendering optimal, 381, 382 
probability of being wrong, 378-380 
figure for, 379 
table for, 378 

regarding largest of two or more variance esti¬ 
mates, 386, 387, 388, 389 
regarding level of performance of process or 
product, 61, 62, 377, 423 
regarding variability of processes (products), 
231, 233, 269 -271, 278, 279, 280-283 
wrong, 271, 281, 282, 313 
Deckinger, R. L., 335n. 

Defective, 7 

Degrees of freedom (f,n,v), in analysis of variance, 
312 

of x 2 , 153, 231, 232, 270 
division by, 12, 13, 43, 312 
of F, 151, 281, 296, 312 

of numerator and denominator of F to be chosen 
equal, 296 

partitioning of, 312, 386 

of s or s*, 12, 13, 43, 199, 203, 270, 312, 387, 388 
of T, 155 

of T-statistics, 145, 152-154 
of t, 148, 203 
non-central, 73 

5: a constant in numerator of non-central t, 73 
Deming, W. Edwards, 110, 93 
Department, of Agriculture, 352 
of Commerce (see Bureau of the Census; Na¬ 
tional Bureau of Standards) 

Navy (eee Bureau of Ordnance, Navy) 

War (eee Army) 

Departures from normality, seriousness of, 49, 50, 
99, 346, 377, 381 

Derivation, of acceptance criteria, 59-63, 66-71, 
263, 264, 270, 280 
of angular transformation, 410, 411 
of Bartlett's adjustment, 413, 414 



444 


TECHNIQUES OF STATISTICAL ANALYSIS 


Derivation, of criteria for “immediate” accept¬ 
ance, in variables inspection for fraction 
defective, 66-71 

of “exact” test for independence in 2 X 2 table, 
260-263 

of formulas for, A (adjustment constant), 70, 71 
A (critical level), 61, 62 
adjusted value of chi or chi-squared, 254n. 
average number of observations, references 
to, 244 

Bartlett's adjustment, of inverse-sine trans¬ 
formation, 413, 414 
Xe, 264n. 

critical level, 61, 62 
K, references to, 101, 108 
i fc, 69-63 

L P , 30, 59, 61, 62 
Lt, 261, 262 

number of observations needed, 60-63, 262- 
264, 380 

OC curves, 30, 59, 61, 62, 261, 262 
<PB, 413, 414 

0 (a, 0 , 711 , 712 ), 281ti., 316, 317 1 
probability that sample means are in opposite 
order to population means, 378, 379 
p (a,0,n), 315 

tolerance-limit factor, references to, 101, 108 
two-point estimates of p and a from probit 
data, 350 

variances of, 351, 352 

variability of amount of inspection, references 
to, 244 

of general formulas for critical level and samplo 
size, 61, 62 

of general tranformation to stabilize variance, 
408-410 

of inverse-sine transformation, 410, 411 
of logarithmic transformation of a or a*, 412, 413 
of optimum arrangement for estimating p and cr 
from probit data, 349-352 
of optimum choice of adjustment constant, 70, 
71 

of rejection criteria, 59-63, 66-71, 263, 264, 270, 
280 

of square-root transformation, of a Poisson vari¬ 
able, 411 

of standard procedure for testing whether new 
process (product) is more variable than 
standard, 270, 271 

whether one process (product) is more vari¬ 
able than another, 280, 281 
of tolerance-limit factors, references to, 101, 108 
of transformation to stabilize variance, 408-410 
of variances of two-point estimates of p and 9 
from probit data, 350-352 
of z-transformation of correlation coefficient, 
413 

Design of experiments, 246-372 
abandoning prior to completion, with provision 
for, 353-362 

for air-testing bombsights, 116-122, 140-147 
analysis of variance, leading to, 297, 310-314 
for comparing two means, 61, 62, 380, 381 


Design of experiments, two proportions or per¬ 
centages, 258-299, 424, 425 
two standard deviations or variances, 267- 
318 

when cost of any one measurement depends 
upon its magnitude, 319-328 
when data are observations on cumulative dis¬ 
tribution, 339-352 
when data are probits, 339-352 
when decisions are based on signs of differences 
between sample means, 377-382 
for determining existence of components of ran¬ 
dom variation, 297, 310-314 
for determining maximum of a function, 363- 
372 

for determining whether level of performance 
of new process (product) exceeds standard, 
17, 277i., 28 ti., 60-66, 81, 82, 423 
whether level of performance of one process 
(product) exceeds that of another, 61, 62 
whether variability of new process (product) 
exceeds standard, 269-278 
is less than standard, 278-280 
where variability of one process (product) 
exceeds variability of another, 280-296 
for dosage-mortality studies, 339-352 
for estimating components of variance, 297, 
310-314 

factorial, 356, 366, 367, 369-372 
failure of equipment, with provision for rate of, 
355 

when independent variable is known only in 
terms of a probability distribution, 331-338 
for maximum of a function, 363-372 
for probit data studies, 339-352 
for proofing gun barrels, 331-338 
randomization in, 141, 371 

rate of failure of equipment, provision for, in, 
355 

replication in, 141, 159 
for sensitivity tests, 339-352 
sequential, 417-429 

for determining maximum of a function, 367- 
372 

when rate of failure of equipment is high, 
353-362 

when severity of any one test cannot be con¬ 
trolled precisely, 331-338 
Designs (see Experiment designs) 

Destructive tests, 9, 321, 331, 341 
Deviate (see Normal deviate) 

Deviation (see Mean deviation; Standard devia¬ 
tion) 

Differences between means, acceptance criterion 
for non-significance of, 9 known, 61, 62 
9 unknown, 71, 72, 150 

analysis-of-variance test of significance of, 385, 
386 

design of experiments when decisions are to be 
based on, 61, 62, 377-382 
effect of grouping or rounding on test of, 9 
known, 198-198 
9 unknown, 200-203, 214, 215 
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Differences between means, F-tesfc of, 385, 386 
number of observations needed for test of, <r 
known, 61, 62 

<r unknown, 71-73, 81, 82, 150 
one-sided test, a known, sequential, 423 
single sample, 61, 62 
a unknown, sequential, 71, 83-85, 426 
single sample, 71, 72 
sequential test of, cr known, 423 
v unknown, 71, 72, 83-85, 426 
signs of, basis for decisions, 377-382 
f-test of, sequential, 71, 72, 83-85, 426 
single sample, 71, 72, 81, 82, 150 
two-sided test, <r known, 61, 62, 426 
a unknown, 150, 426 
yardsticks for, 150 

Differences between proportions or percentages, 
acceptance criteria for non-significance of, 
251-255, 263, 2fW, 424, 425 
angular representation of, 262, 264 
design of experiments for testing, 958-26o, 424 
425 

number of observations needed for t4t of, 262- 
264 

sequential test for, 424, 425 
single-sample test, approximate methods for, 
249, 250, 253-257, 263 
“exact” method, 250-253 
Fulcher-Zubin Item Analyzer, 257, 258 
incorrect method, 256 

(See also Binomial distributions, experiments 
for comparing parameters of two; 

Digits, significant, 189n., 190n. 

Discontinuities of sampling distributions, result¬ 
ing from grouping data, 203-218 
Discriminant function, best, for discriminating be¬ 
tween two populations of known form, 381, 
382 

Discriminating power of experiment, 269 
Dispersion, measures of (see Mean deviation; 

Range; Standard deviation) 

Distribution, of adjusted value of chi (xc), 255 
of amount of inspection, with double, multiple, 
and sequential sampling, 235-244 
computational methods for determining, 244 
figures showing, 241, 243 
nature of, in repeated sequential sampling, 
428 

of angular difference between two proportions 
or percentages, 263, 264 
of arithmetic mean, 197, 198 

when data are grouped, 196-198, 203, 212, 
213 

of average error, 162, 163 
of Bartlett's test for homogeneity of variance 
estimates, 387, 388 

beta function, incomplete, 333, 334, 338, 392- 
894 

binomial, 250, 251, 421, 427 
bivariate normal, 146 
of chi, corrected for continuity (xe), 255 
of ohi-squared (x*), formula for, 270 

percentage points of, 108-110, 258n., 315n. 

relation to F-distribution, 816 


Distribution, of chi-squared (x*), relation to 
normal distribution, 258n. 
to ^distribution, 270, 315 
to ^-distribution, 155 

of Cochran’s test for significance of largest vari¬ 
ance estimate, 389, 392, 393 
of correlation coefficient, 413 
cumulative, 339-352 
Error (see Normal distribution) 
of error of an interpolated value, 221 
of F, formula for, 281 

relation to chi-squared distribution, 316 
to incomplete beta function, 392, 393 
significance levels of, 394 

detailed references to, 317n. 

Paulson's f pproximation, 393, 394 
whe.i data are rounded, 214, 215 
Gaussian (see Normal distribution) 
generalized normal, x97 

(Sec also Gram-Charlier series) 
oi generalized Student ratio (T), 155 
of Hotelling's T, 155 
7-shaped, 194, 222 
kurtosis of a, measures of, 197, 198 
of mean 'see Distribution of arithmetic mean) 
of mean deviation, 162, 163 
of median, 162 

moments of, about origin, 193 
about true mean, 195 
centroidal, 195 

effect of grouping or rounding on, 193-196 
ideal, 193 
realistic, 193 

of N, in double, multiple, and sequential sam¬ 
pling, 235-244 

computational methods for determining, 244 
figures showing, 241, 243 
nature of, in repeated sequential sampling, 
428 

non-central t, 8n., 22, 64, 71-82 

use in variables inspection for fraction defec¬ 
tive, 71-82 
non-normal, 99, 381 
normal, figure showing, 10 
formula for, 101 
(See also Normal distribution) 

Toisson, 411, 412, 426 

relation to x 3 distribution, 109 
of proportions or percentages, 250, 251, 263, 
264, 397 

(See also Binomial distribution) 
of r, 413 

of range, 229, 230, 234 
rectangular, 195, 214, 215 
of a or « 2 , 58, 199 

when data are grouped or rounded, 198-200, 
203, 204 

relation to x 3 distribution, 270, 315 
"semi-invariants of, 196n. 
skewness of, measures of, 197, 198 
of Student’s t, 201, 202 

when data are grouped or rounded, 200, 
208, 214, 215 
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Distribution, of Student’s z, 187, 188, 200 
of T, 154. 155 

relation to x 2 distribution, 155 
relation to F-distribution, 155 
of J’-statistics, 152-155 
of t, 72, 148 

effect of grouping or rounding on, 200-203, 
214, 215 

non-central, 8n., 22, 64, 71-82 
percentage points of, 148, 148n. 
of variance ratio (F), formula for, 281 
relation to x^distribution, 316 

to incomplete beta function, 392, 393 
significance levels of, 394 
detailed references to, 317n. 

Paulson’s approximation, 393, 394 
when data are grouped or rounded, 214-215 
of £, 197, 198 

when data are grouped or rounded, 196, 198, 
203, 212, 213 

Division of War Research, Columbia University, 
vii 

Dodge, Harold F., 8n., 93 
Dosage-mortality curve, 342, 343n., 352 

design of experiments for determining, 339-352 
Double dichotomy, analysis of data in form of, 
sequential, 424, 425 
single sample, 250-258 

approximate method (xe), 253-255 
“exact” method, 250-253 
design of experiments leading to, 258-265 
number of trials needed, 258-261 
formula for, 259, 260 
nomogram for, 260 

operating characteristics (OC) of single-sample 
test, 261 

OC curve, 6gure for, 263 
formula for, 262 
OC surface, 261, 262 
sequential analysis applied to, 424, 425 
Double sampling, attributes inspection for frac¬ 
tion defective, 237-244 

average amount of inspection, compared to 
multiple and sequential sampling, 238-249 
computational methods for, references to, 244 
figures showing, 240, 242 
characterized by relation of rejection number 
( Rk ) to acceptance number ( Ak ), 237 
variability of amount of inspection [<r(A0], com¬ 
pared to multiple and sequential sampling, 
238-244 

computational methods, references to, 244 
figures showing, 241, 243 
Dwyer, Paul S., 172n., 183 

E 

Economic time series, 177 

Efficiency, of sample statistics as estimators of 
* population parameters, 227n. 
arithmetic mean as estimator of n, 162 
*When data are grouped or rounded, 197 
mean deviation as estimator of <r, 163 


Efficiency, of sample statistics as estimators of 
population parameters, median as esti¬ 
mator of ft, in normal samples, 162 
range as estimator of <r, 227 
8 as estimator of <r, 162, 227 
of sampling-inspection plans and procedures, 
attributes versus variables, 26—28 
double sampling, versus multiple and sequen¬ 
tial, 237-244 

multiple sampling, versus double and sequen¬ 
tial, 237-244 

sequential sampling, versus non-sequential, 
26, 85, 237-244, 419, 420 
single sampling, versus sequential, 26, 85, 
419, 420 

of statistical tools, 386n., 394 
Eisenhart, Churchill, iii, ix-xii, 186, 268, 376, 384, 
396 

Elderton, W. Palin, 194n., 222 
Enumeration data (sec Attributes; Binomial dis¬ 
tribution; Poisson distribution) 
e: probability of acceptance, 59 
e: probability that a normal deviate will exceed 
K e , 17 

Error, average, 162 

(See also Mean deviation) 
components of, 269, 297, 310-314 
of first kind, 271, 281, 282 
grouping of, 187-189, 195, 218, 222 
of interpolated value, 218, 219, 222 
of measurement, 188, 189 
. probable, 187n., 223 
random, 269, 297, 310 
of second kind, 271, 281, 282 
standard (see Standard deviation) 
systematic, 222, 310 
of tabular value, 215-218, 221 
Estimate, statistical use of the term, 12 
Estimation, abandoned prior to completion, 355, 
356 

by confidence intervals, 42, 43, 99, 100, 332-336 
effect of grouping or rounding on, 192, 196— 
203, 213-215 

of fraction defective, by attributes inspection, 
332-336 

by variables inspection, 42, 43, 58 
of maximum of a function, 363—372 
of n + k<r, by £ + ks, 58 

of population mean (ft), from probit data (ob¬ 
servations on cumulative distribution), 343, 
350 

by sample mean, 12, 58, 161, 162, 192 

effect of grouping or rounding on, 187, 188, 
196-198, 200, 203, 212, 213 
by sample median, 161, 162 
from sensitivity data, 343, 350 
of population proportion (P), by confidence 
belts, 332—336 

by sample proportion (p), 397, 410 
from transformed proportion (ip), 403, 404 
of population standard deviation ( 9 ), from pro¬ 
bit data (observations on cumulative dis¬ 
tribution), 343, 350 
from mean deviation, 162, 163, 169 
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Estimation of population standard deviation ( 7 ), 
from «, 12, 58 

of population variance («■*), by mean deviation, 
162, 163, 169 
by «*, 162 

when data are grouped or rounded, 198- 
200, 212-214 

sequential, reference to, 238n. 
of slope and intercept of regression line, when 
cost of each measurement depends upon its 
magnitude, 319-328 
when probit data are involved, 343 
of statistical tolerance limits, 97-110 
"Exact” method, for analyzing 2X2 table, 250- 
253 

Expected value, 397, 408 

of mean deviation, in normal samples, 169 
of p, in binomial samples, 397, 410 
of <p(p) and 403, 404 

tables giving, 100-402 
of range, in normal samples, 227a. 
of «, in normal samples, 58 
of s’, 192, 412 * 

when data aie grouped or rounded, '799, 212, 
213 

of x, arbitrary population, 197, 198 

when data are grouped 01 rounded, 197, 198 
of i 4- ks, 58 

Experiment designs, factorial, 3.id. 366, 367, 369- 
372 

definition of, 366n. 

problems for which ideally suited, 366 
sequential, 417-429 

for determining maximum of a function, 367- 
372 

when failure rate of equipment is high, 353- 
362 

(See also Sequential analysis; Sequential- 
sampling-inspection plans) 

Experimental method, compared to standard 
method, 250 

Experimentation, when cost of each measurement 
depends upon its magnitude, 319-328 
when failure rate of equipment is high, 355, 356 
when measurements are observations on cumu¬ 
lative distribution, 339-352 
role of randomization in, 141, 142, 160 
role of replication in, 159 
when severity of any one test cannot be con¬ 
trolled precisely, 331—338 
(See also Design of experiments; Experiment 
designs; Experiments) 

Experiments, abandoning prior to completion, 
provision for, 355, 356 

for ammunition-proofing of gun barrels, 331-338 
metallurgical, 321 

(See also Design of Experiments; Experimen¬ 
tation; Experiment designs) 

Extreme observations, 234 

F 

F: the variance ratio, or any random variable 
having the variance-ratio distribution, 151, 
152, 155, 281 


F: the variance ratio, analysis of variance, uses 
in, 311-314 

definition of, customary, ambiguity of, 151n. 
general, 281 

for grouped or rounded data, 215 
degrees of freedom (ni,ni\vi,v 2 ), ambiguity re¬ 
garding, 151n. 

unambiguous specification of, 281 
distribution of, formula for, 281 

percentage points, Paulson’s approximation, 
393, 394 

relation to percentage points of x 2 -dis- 
tribution, 316 
Fisher’s z, 317n. 

incomplete beta function, 392, 393 
ables of, ic'erences, 151n., 184, 31 7n., 318 
comme-itc on, 151n., 152n., 317n. 
distribution of 1 >g F, when degrees of freedom 
are equal, 152 
e 2r , 317n. 

formulas for, general, 281 

for grouped or rounded data, 215 
special, 151, 169 
relationship to x 2 , 215, 281, 316 
to Fisher’s z, 317w. 
of Hotelling’s T to, 155 
to incomplete beta function, 392, 393 
significance levels of, Paulson’s approximation, 
393, 394 

relation to significance levels of x^distribu- 
tion, 316 

of incomplete beta function, 392, 393 
tables of, references to, 151n., 184, 317rt., 
318n. 

comments on, 151n., 152«., 317n. 
test of homogeneity of sample means, 385, 386 
compared to Bartlett’s test for homogeneity 
of variance estimates, 386 
discriminating power of, improved by parti¬ 
tioning comparisons, 386 
as “omnibus” test, 386 

test of ratio of two variance estimates, 280-318 
use in planning and interpreting experiments 
for comparing two standard deviation or 
variance estimates, 280, 283, 296 
for detecting presence of components of 
variance, 311-314 

for detennining whether one process (prod¬ 
uct) is more variable than another, 280- 
283,296 

F(N ); any function of N that is independent of <r, 
61 

/: degrees of freedom of chi-squared (x 2 ), 75 
Factorial design, of experiments, 356, 366, 307, 
369-372 

definition of, 366n. 
situations for which ideal, 366 
Factorials, in "exact” analysis of 2 X 2 table, 
251-253 

logarithms of, reference to tables of, 253n. 
Failure of equipment, allowance for rate of, in 
planning experiments, 355-362 
Ferris, C, D., 318n. 

Fiducial limits (zee Confidence limits) 
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Figures, list of principal, xiv 
Fire-control devices, 240 

Fisher, Ronald A., 18, 93, 109, 116, 120n., 148n., 
151, 152n., 159n.,. 163n., 167, 177n., 183, 
192n., 197, 200, 201, 203, 21 In., 212, 218, 
219n., 220, 221, 222, 232, 234, 250n., 255, 
258n., 265, 270n., 297, 315n., 317n., 318, 
325n., 328, 342n., 343n., 352, 366n., 372, 398, 
399, 400, 405, 407, 413, 414n., 415 
Fisher-Cornish type of expansion, 109 
Five-point OC curve, 28- 30, 427 
Flatness of probability distribution (aee Kurtosis) 
Forman, H. Irving, x 

Formula, for A (adjustment constant), 41, 54, 70, 
71 

A (critical value of likelihood ratio), 420 
A (critical value of normally distributed test 
function), 62 

A (function in formulas for sequential variables 
plan for fraction defective), 84 
Ak, in terms of Rk, 237 

a (critical value of X in sequential analysis), 84 
420 

o (multiple of <r in formula for mean of a), 58 
acceptance criterion, in analysis of variance, 
312, 313 

for difference of means, 61, 62 
for difference of proportions, 263, 264 
general, in sequential analysis, 420 
with (large) single sample, 61, 62 
for mean, in sequential test, 423 
in single-sample test, 61, 62 
for proportion, in sequential test, 422 
in single-sample test, 260n., 264 
when range in samples of k determines lot 
quality witl*. single sample, 229, 232 
for ratio of two variance estimates, 281, 283 
for s a , in single-sample test, 232, 270, 279 
in variables inspection for fraction defective, 
12, 33-35, 37, 41, 69-71 
acceptance line, for number of “successes,” in 
binomial test, 422 

for number of "success-failure” pairs, in 
double-dichotomy test, 425 
for sum of observations, in test for ft, with <r 
known, 423 

for sum of squared deviations, in test for <r, 
with ft unknown, 232 
adjusted value of chi (x«)» 254, 257 
adjustment constant (A), 41, 54, 70, 71 
adjustment of k, N fixed, 19, 20 
amount of inspection, attributes plan, sequen¬ 
tial sampling, 427, 428 
single sample, 259, 260n., 263, 264 
binomial test, one-sided, sequential sampling, 
427, 428 

single sample, 259, 260n., 263, 264 
when decisions are based on signs of differ¬ 
ences between sample means, 380 
difference of means test, a known, 61, 62 
v unknown, 17, 27n., 28n., 60-66, 81, 82 
difference of proportions test, 259, 260, 263, 
264 


Formula, amount of inspection, double-dichotomy 
test, 259, 260, 263, 264 
fraction defective test based on variables, 17 
when ft is known and 9 unknown, 27n., 
28n., 231, 278n. 

when n is unknown and 9 known, 27n., 
28n., 62 

when ft and c are both unknown, 17, 27n., 
28n., 60-63 

general one-parameter sequential tests, 427 
single-sample tests, 61, 62 
mean test, v known, 27n., 28n., 62 
9 unknown, 17, 27n., 28n. f 60-66, 81, 82 
proportion test, sequential sampling, 427, 428 
single sample, 259, 260n., 263-264 
when range in samples of k determines lot 
quality, sequential sampling, 233 
single sample, 231, 278n. 
standard deviation test, sequential sampling, 
233 

single sample, 231, 278n. 
variables inspection plans for fraction defec¬ 
tive, 17 

when ft is known and 9 is unknown, 27n., 
28n., 231, 278n. 

when ft is unknown and 9 is known, 27n.. 
28n., 62 

when ft and 9 are both unknown, 17, 27n., 
28n.,-60-63 

analysis-of-variance table, entries of, 312 
angular difference, between two binomial proba¬ 
bilities, 262, 264 

between two observed proportions, 264 
angular transformation, 258, 398, 399 
modified, 401 

arithmetic mean, 12, 100, 101, 192 
asymptotic, for confidence limits for population 
proportion, 336 

for percentage points of ^-distribution, 155 
for percentage points of f-distribution, 148 
for tolerance-limit factor K, 101 
ASN curve, for general one-parameter sequen¬ 
tial tests, 427 

for sequential one-sided binomial test, 427, 
428 

with acceptance line omitted, 358 
for sequential one-sided test of <r, 233 
b (critical value of —X in sequential analysis), 
84, 421 

b (function that appears in formula for k when 
N is fixed), 20, 59, 61 

6 (multiple of 9/y/2 (N — 1) in formula for 
standard deviation of s), 58 
Bartlett’s adjustment, of inverse-sine trans¬ 
formation, 399 

Bartlett’s test of homogeneity of variance esti¬ 
mates, 387, 388 
beta function, incomplete, 392 
binomial distribution, 251 
boundaries of acceptance and rejection regions, 
sequential binomial test, 422 
sequential double-dichotomy test, 425 
sequential mean test, 9 known, 423 
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Formula, boundaries of acceptance and rejection 
regions, sequential procedure for determin¬ 
ing whether to complete or abandon an 
experiment, 356-361 

sequential standard deviation test, p un¬ 
known, 232, 233 

single-sample variables inspection plans for 
fraction defective, 35-37, 41, 53, 54, 56, 57, 
67-71 
cs, 58 
x«> 254 

X*, 254, 256, 257 
X^distribution, 270 

Cochran’s test for significance of largest vari¬ 
ance estimate, 388 

approximate significance levels of, 392 
confidence limits, for parameter of binomial dis¬ 
tribution, 335, 386 
control limits, for s«eans, 147-148 
correlation coefficient, transformation of, 4'3 
critical level (.4) of normally diet ibuted iest 
function, 62 - 

degrees of freedom, in analysis of varia ljfe. 312 
of T, 155 

of T-statistics, 145, 152-154 
* (/,<o,€), 74 

for estimating varianc® of limr.r regression 
fitted by weighted least squares, 343 
expected frequencies, in 2 X 2 table, 253 
expected value, of mean deviation, in normal 
samples, 169 

of p, in binomial samples, 397, 410 
of a, in normal samples, 58 
of a 2 , arbitrary population, 197, 198 
when data are grouped or rounded, 199 
of £, 197, 198 
of 2 + k8, 58 
F, general, 281 

for grouped or rounded data, 215 
special, 151, 169 

71, 198 

72 , 198 

general transformation to stabilize variance, 
409 

generalized Student ratio (3T), 154, 155 
ht and hi, 232, 233, 356, 357, 422, 423, 425 
Hotelling’s T, 154, 155 
interpolated value, error of, 219 
variance of, 219, 220 
interpolation without differences, 219 
inverse-sine transformation, 258, 398, 399 
modified, 410 
K, 101, 108 
Ka, 263 
Kfi , 263, 264 
Kt, 17, 18, 31, 260 
k, 17. 19, 59, 60, 63, 81, 82 
kurtosis, measures of, 197, 198 
L P , 30, 51 
Lv, 232, 233 
Lr, 262 

Lagranglan interpolation coefficients, 219 
m, 360 

mean (tee Formula, arithmetic mean) 


Formula moments, of arbitrary distribution, 193 
when data are grouped or rounded, 193,195 
average, 194 

of distribution of a 2 , grouped and ungrouped 
data, 199 

of t, grouped and ungrouped data, 199 
of £, grouped and ungrouped data, 198 
N (number of observations needed in each of 
two samples), 259, 264, 380 
N (number of observations needed in single 
sample), 17, 26, 28, 60-62, 72, 73, 81, 231, 
259 , 260n., 264, 278n. 

"X, 233, 427, 428 
N a , 26, 259, 260n., 264 
N m , 28n., 61, 62 
N>, 28a.. 231, 278n 
Nt, 17, Z8n., 60, 61, 81 
non-central f, 72, 73 

normal distribution, 17, 18, 31, 161, 260, 263, 
423 

number of observations needed, with attributes 
inspection, sequential sampling, 427, 428 
single sample, 259, 260n., 264 
binomial test, ouc-sidod, sequential sampling, 
427. 428 

Bingle sample, 259, 260n., 264 
X 2 test of variance estimate, 231, 278n. 
when decisions are based on signs of differ¬ 
ences between sample means, 380 
for determining whether level of performance 
of a process ^product) exceeds standard, 
17, 28n., 60-62, 81 

whether variability of performance of a 
process (product) exceeds standard, 231, 
278n. 

difference of means test, single sample, <r 
known, 61, 62 

er unknown, 17, 28n., 60, 61, 81 
difference of proportions test, sequential 
sampling, 424, 427, 428 
single sample, 259, 260, 263, 264 
double-dichotomy test, sequential sampling, 
424, 427, 428 

single sample, 259, 260, 263, 264 
fraction defective test based on variables, 17 
p known, <r unknown, 28n., 231, 278n, 
p unknown, a known, 28n., 61, 62 
p and a unknown, 17, 28, 60, 61, 81 
general one-parameter tests, sequential 
sampling, 427 
single sample, 61, 62 

mean test, single sample, a known, 28n., 61, 
62 

<r unknown, 17, 28n., 60, 61, 81 
proportion test, sequential sampling, 427,428 
single sample, 259, 260n., 264 
when range in samples of k determines lot 
quality, sequential sampling, 233 
single sample, 231, 278n. 
for setting toleranoe limits, references to, 109, 
110 

standard deviation test, sequential sampling, 
233 

single sampling, 231, 278n. 
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Formula, number of observations needed, vari¬ 
ables inspection plans for fraction defec¬ 
tive, 17 

p known, 9 unknown, 28n., 231, 278n. 
p unknown, a known, 28n., 61, 62 
p and v unknown, 17, 28n., 60, 61, 81 
OC curve, for analysis-of-variance tests, 313 
attributes inspection, single sample, 260n., 262 
binomial test, one-decision, sequential, 368 
one-sided, single sample, 260n., 262 
X s test of variance estimate, 231, 232, 271, 
278n., 279, 315 

when decisions are based on signs of differ¬ 
ences between sample means, 379 
difference of means test, a known, 61-62 
9 unknown, 30, 59-61, 62, 63, 72, 73 
difference of proportions test, single sample, 
262 

double-dichotomy test, single sample, 262 
experiments for determining whether level of 
performance of one process or product 
exceeds standard, 30, 59-63, 72, 73 
whether variability of one' process or 
product exceeds standard, sequential 
sampling, 233 

single sample, 231, 232, 271, 278n., 279, 
315 

whether variability of one process or 
product exceeds variability of another, 
282, 316 

F-tests of variance estimates, 282, 313, 316 
fraotion defective test based on variables, 
30, 59-63, 72, 73 

general one-parameter tests, single sample, 
61, 62 

mean test, "ingle sample, a known, 61, 62 
9 unknown, 30, 59-61, 62, 63, 71-73, 76, 77 
one-decision sequential binomial test, 358 
proportion test, single sample, 260n., 262 
when range in samples of k determines lot 
quality, sequential sampling, 233 
single sample, 231, 232, 271, 278n., 279, 315 
standard deviation tests, 231, 232, 271, 278n., 
279, 282, 313, 315, 316 
variables inspection for fraction defective, 
30, 59-63, 72, 73 
<p and <pb, 399, 401 
0 (ar,0,ni,na)» 282, 316 

probability that sample means will be in oppo¬ 
site order to population means, 379 
probit, variance of a, 350 

R p , for one-decision sequential binomial test, 358 
r, transformation of, 413 
p (a,0,n), 231, 232, 271, 278n., 279, 315 
» (slope of decision line, in graphical procedure 
for sequential tests), 233, 358, 422, 423, 425 
8 or s’, 12, 43, 162, 192 

sample size (see Formula, number of observa¬ 
tions needed) 

• significance levels of Cochran’s test, 392 
of T-distribution, 155 
of /-distribution, 148 

standard deviation, of observed proportion, 

255, 897 


Formula, standard deviation, of mean, 58 
of s, in normal samples, 58 
of 6, 258 
of x, 58 

of £ -f- k8, in normal samples, 58 
(See also Formula, variance) 

Student’s t, 200 
Student’s z, 187n. 

T (angular difference between two binomial 
probabilities), 262, 264 
T (Hotelling’s), 142, 154 
T-statistics, 131, 133, 135, 145 
t (angular difference between two observed pro¬ 
portions), 264 
t (Student’s), 200 
non-central, 72, 73 
t (/,«,«), 74 

tolerance-limit factor (K), 101, 108 
two-point estimates of p and <r from probit 
data, 350 

variances of, 351, 352 
transformation to stabilize variance, 409 

(See also Inverse-sine transformation; z- 
transformation) 

variance, of angular difference between two 
observed proportions, 264 
arithmetic mean, 162 

when data are grouped or rounded, 198 
binomial distribution, 397, 410 
correlation coefficient, 413 
"error” due to grouping or rounding, 195, 
218 

grouped or rounded variable, 195 
interpolated value, 219 
mean, 162 

when data are grouped or rounded, 198 
median, 162 
<p and <pb, 400, 407 
Poisson distribution, 411 
probit, 350 
proportion, 397, 410 
r. 413 

s 2 , arbitrary population, 199 

when data are grouped or rounded, 199 
t (angular difference between two observed 
proportions), 264 
0 (transformed proportion), 400 
two-point estimates of p and <r from probit 
data, 351, 352 
A, 162, 198 

variance estimates, 12, 43, 162, 192 
A, 12, 43, 100, 192 

yardsticks, for differences of means, 150 
Fourfold table (see Double dichotomy) 

Fraction defective, definition of, 10 
dependence upon p and 9 of items in lot, 10 
as measure of lot quality, 11, 42 

(See also Acceptance in peotion, for fraction 
defective) 

Frankel, Lester, 149n., 155n., 184 
Freedom, degrees of (see Degrees of freedom) 
Freeman, Harold A., ix, 419n. f 428 
Frequencies, expected, in 2 X 2 table, 253 
Frequency distributions (see Distribution) 



INDEX 


451 


Friedman, Milton, ix, xi, xii, 6 , 320, 340, 364, 
419n. 

Fry, Thornton C., 253n., 266 
Fulcher, John S., 257n., 265 
Fulcher-Zubin Item Analyzer, 257, 258 
Function, experimental detoi mination of 
maximum of, 363-372 

G 

T: confidence coefficient associated with confi¬ 
dence belts for parameter of binomial distri¬ 
bution, 335, 336 

7 : a confidence coefficient, 42, 43, 98, 100, 108, 
332, 336 

yd confidence coefficient used in judging whether 
to continue normal inspection, 44 
yd confidence coefficient used in judging whether 
to resume normal inspection, 44 
71 : Fisher’s measure of skewness, 197, 198 
Y 2 -‘ Fisher’s measure of kurtosis, 197, 198 
Gamma distribution (see Chi-squared distrib* lion/ 
Gaussian distribution (see Normal <list* ibuti >n) 
Geary, R. C., 201, 222 # 

General one-para,tr..ster tests, sequential safipUng, 
426-428 

single sampling, 61, 62 

General transformation to stabilize varianoe, 409 
Generalized normal curve, 197 

(See also Gram-Charlier series) 

Generalized Student’s ratio (T), definition of, 142, 
164, 155 

distribution of, 154, 155 

asymptotic expansion for percentage points 
of, 155 

free from nuisance parameters, 115 
relation to x^distribution, 155 
to F-distribution, 155 
optimum properties of, 147 
as “score,” combining results of two or more 
tests, 113-116 
Generalized variance, 166 

Girshick, Meyer A., ix, xi, xii, 6 , 226, 238n., 244, 
354, 419 

Glaisher, J. W. L., 191, 222 
Glossary for Chap. 1, 85-92 
Goldberg, Henry, x, 83n., 109, 149n„ 184 
Gosset, William Sealy, 187 
(See also Student) 

Gram-Charlier series, 197 

Graphical procedure, for analyzing binomial data, 
260-262 

binomial-probability paper, reference to, 414 
nomogram for, 260 
Zubin’s, 398, 416 

for determining whether to abandon or com¬ 
plete an experiment, 356-359 
worked example, 359-361 
in sequential analysis, binomial test, one-deci¬ 
sion, 358 
one-sided, 422 

comparison of two proportions, 425 
double-dichotomy test, 425 
mean test, 9 known, 423 
proportion test, 422 


Graphical procedure in sequential analysis, 
standard deviation test, m unknown, 232, 233 
in variables inspection for fraction defective, 
34-37, 41 

adjustment constant (A), 36, 37, 40, 69-71 
optimum choice of, 41, 54, 70, 71 
advantages of, 36 

boundaries of acceptance and rejection re¬ 
gions, figures showing, 38-40 
formulas for, 35-37, 41 
derivation of, 66-71 

impossible region, indicating a mistake, 36, 37 
figures showing, 38-40 
when upper and lower limits on a single 
quality characteristic define acceptable 
quality, 52* 56 

oim two-sided criterion, 54-67 
pair of one-sic.ed criteria, 53, 54 
Grouped data, impossibility of avoiding use of, 
187, 188, 189 

Gro iping (of data), 187, 188, 191-215 

effect of, on descriptive statistics, 187, 188 
on distribution of F, 214, 215 
of «*, 193 200, 203-214 
of 200-203, 214, 215 
of x, 196-198, 212, 213 
on estimation, of population mean, 197, 198, 
200-203 

population variance, 198-200, 203-214 
on moments, 193-196 

representation as effect of a random group¬ 
ing or rounding error, 195, 218 
on standard procedures of statistical infer¬ 
ence, 191-215 

with large samples, 191-203 
with Rinall samples, 203-216 
impossibility of avoiding, 187, 188, 189 
reasons for, 187 

Grouping interval, recommended choice of size of, 
192, 214 

Grouping lattice, effect of position of, on distri¬ 
bution of s 2 , 206-213 
on distribution of £, 212, 213 
on estimation of p and a 2 by £ and •*, respec¬ 
tively, 212, 213 
random location of, 194 

extends validity of Sheppard’s corrections, 
196 

when only non-negative observations can 
octeur, 196 

Groups, variation within and between, 312 
Grubbs, F. E., 318n. 

Gun barrels, pressure proofing of, 331-338 
H 

hi number of lots from which samples are taken 
and combined for computing process average, 
42 

hi: negative of intercept of lower decision line 
(acceptance line) in graphical procedure for 
sequential tests, 426 

formula, for binomial test, one-sided, 422 
double-dichotomy test, 425 
mean test, 9 known, 423 
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hi: negative of intercept of lower decision line 
(acceptance line) in graphical procedure for 
sequential tests, formula, standard deviation 
test, m unknown, 233 

taken equal to infinity, to provide for rejection 
only, 358 

use in constructing five-point OC curve, 427 
hi: intercept of upper line (rejection line) in 
graphical procedure for sequential tests, 426 
formula for, binomial test, one-decision, 358 
one-sided, 422 
double-dichotomy test, 425 
mean test, a known, 423 
standard deviation test, n unknown, 233 
use in constructing five-point OC curve, 427 
Hall, Philip, 218n., 222 
Hartley, H. O., 386n., 394 
Hastay, Millard W„ iii, ix 
Hayashi, Keiichi, 402n., 416 

Histogram, representation of populations which 
grouped data correspond, 193-195 
Homogeneity of binomial distributions, experi¬ 
ments for testing, 249-265, 424, 425 
analysis of data from, sequential sampling, 
424-425 

single sample, 249-258 
planning of, sequential, 424 
single sample, 258-265 
of means, 61-62, 385-386 
of proportions, 249-265, 424, 425 
of 2 X 2 table, 249-265 
of variance estimates, 385-389 

Bartlett’s tost for, 385, 386, 387, 388 
Cochran’s test for, 386, 387 , 388, 389 
significance levels of, tables of, 390, 391 
Homogeneous variance, advantages of, 397, 398 
tests for, 385-389 

transformation of data to achieve, general 
formula for, 408, 409 
examples, 410-413 

Hotelling, Harold, ix, xi, 112, 114n., 149n., 154n., 
155n., 172n., 184, 365n., 368n., 371n., 372, 398 
Hotelling’s T, definition of, 142, 154, 155 
distribution of, 154, 155 

asymptotic expansion for percentage points 
of; 155 

free from nuisance parameters, 115 
relation to x 2 -distribution, 155 
to F-distribution, 155 
optimum properties of, 147 • 

as "score,” combining results of two or more 
tests, 113-116 
Hsu, P. L., 147, 166n., 184 

Hypotheses, statistical tests of (see Statistical 
tests) 

Hypothesis, alternative, 419, 420 
null, 420 

under test, 419, 420 

I 

Ignition current, 341 

Immediate acceptance, criterion for, in variables 
inspection for fraction defective, 34, 37, 41, 
67, 69, 70 


Impossible region, indicating a mistake, 36, 37 
figures showing, 38-40 

Inconsistent requirements in specifications, 50n. 
Independence of acceptance tests, 51 
tables for, 249-255 

(See also Double dichotomy; 2X2 tables) 
of £ and s in samples from a normal population, 
200 

effect of grouping or rounding on, 187, 201 
Indifference quality, definition of, 20 

use in variables inspection for fraction defec¬ 
tive, 21 

Indifference value, 282 

Inferences about population (see Estimation; 

Statistical tests) 

Information about lot, 7 

Inspection, acceptance (see Acceptance inspec¬ 
tion) 

amount of (see Amount of inspection) 
cost of (see Cost; Amount of inspection) 
curtailed, 14, 238, 361 
destructive, 9, 321, 331, 341 
normal, 43, 44 

plan, considerations involved and steps in 
choosing, 45-49 
reduced, 41-45 

sampling, effectiveness of, 156-158 

(See also Operating-characteristic curves) 
objections of, 156-158 
risks involved, 228, 229 

(See also Consumer’s risk; Producer’s risk) 
value of, dependent on quality of submitted 
product, 42 

for several quality characteristics, 50-52, 113- 
116 

tightened, 41-45 
variability of, 235-244 
by variables (see Variables, inspection by) 
Interpolated value, accuracy of, 219, 220n., 221n. 
error of, 219 

precision or variance of, 219-221 
Interpolation, 218-221 

coefficients, Lagrangian, 108, 109, 219, 222 
Salzer’s inverse, 109 
without differences, 219 
harmonic in k or n, 317, 388 
in Vn, 275, 316, 388 
inverse, 109 

probability wise, 316n., 318 
Interpreting, analysis-of-variance experiments, 
297, 310-314 

experiments for comparing two proportions, 
249-258 

for comparing two standard deviations, 267- 
318 

for determining whether variability of new 
process or product exceeds standard, 
267-278 

is less than standard, 278-280 
for determining whether one of two processes 
is more variable than another, 280-296 
experiments when severity of any one test 
cannot be controlled precisely, 331-338 
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Inverse hyperbolic tangent, 413 (see e-transforma¬ 
tion) 

Inverse-sine transformation, of proportions, to 
stabilize variance, 257-259, 261-264, 336, 
395-416 

adjustment of, 399, 401-407 
references to, 399n., 405n., 407n. 
table to facilitate, 406 

Bartlett's adjustment of, 399, 401-407,413,414 
Bartlett’s form of, 398, 400, 401 
bias of, 403, 404 
bibliography for, 414-416 
derivation of, 410, 411 
effectiveness of, 401-408 
figures showing, 400-402 
tables showing, 403-405 
forms of, Bartlett’s, 398, 400, 401 
Bliss’, 398, 400 

Fisher’s original, 398, 399, 400 
SRG’s first v ision, 405, 406 
Yates’, 398, 400 
Zubin’s, 257, 258, 398, 405 
history of, 398 ^ 

nature of, 39t»-400 
presentation of final results, 407 
tables to facilitate application of (ae* Bibliog¬ 
raphy, 414-416) 

use to determine confidence limits for binomial 
parameter, 336 

to determine number of trials needed in 
experiments for comparing two propor¬ 
tions, 259, 260 
formula for, 259 

derivation of, 261-264 
nomogram based on, 260 
use to determine OC curve of experiment for 
comparing two proportions, 261, 262 
use in sequential analysis, references to, 405n., 
415 

Irwin, J. O., 218n., 222 

Isaacson, E., 416 

J 

./-shaped curves, 194, 222 

Jennett, W. J., 8n., 49, 58n., 72n., 93, 99 

quotation from paper by, with B. L. Welch, 49- 
50 

Johnson, N. L., 8n., 22, 64, 72, 73, 74, 76, 77, 79, 
86 , 93 

K 

K: difference between upper limit (U) specified 
for a variable and its population mean (ft), in 
unite of population standard deviation (?), 
9-13 

K: minimum proofing pressure prescribed for gun 
barrels, 331 

K : multiple of « used in setting tolerance limits, 
97-100 

formulas for, 101, 108 
table of values for, 102-107 
uses of, examples of, 100, 101 


Ka : normal deviate exceeded with probability at 
(producer’s risk, or probability of an error of 
first kind), 17, 278n. 
formula for, 263 

relation (of to percentage points of x*-dis- 
tribution, 258n. 

use in acceptance inspection, for average 
performance, 61, 62 

for dispersion of performance, 231, 232, 278n, 
for fraction defective, by attributes, 259, 
260n., 261-264 

by variables, 17-21, 60-63, 81, 82 
for fraction of sample ranges less than 
specified limit, 231, 232 
for mean quality of lot, 61, 62 
for standard deviation of quality in lot, 231, 
232, 278/7. 

use in formula for critical level (A) and number 
of observations needed in general single¬ 
sample, one-parameter test, 61, 62 
use in planning experiments for comparing two 
proportions, 258-264 

for comparing two standard deviations or 
variances, 278n. 

use in probability-wise interpolations in per¬ 
centage-point tables, 3l6n. 
use in variables inspection for fraction defec¬ 
tive, 17-21, 69-63, 81, 82 
(See also K e ) 

Kp: normal deviate exceeded with probability 0 
(consumer's risk, or probability of an error 
of second kind), 17, 278n., 380 
formula for, 263, 264 

use to determine number of observations needed 
when decisions will be based on signs of 
differences between sample means, 380 
(See also K a , uses for; K t ) 

K f : normal deviate exceeded with probability «, 
17, 380 

formula for, 17, 18, 31, 260 

relation to “probit,” 343 

tables of, references to, 10n., 18, 343n. 

need for care in using, lOn. 
use in asymptotic expansion for percentage 
points of 1-distribution, 148 
uses, other (aee K a and Kp\ K\-y\ Ko; Ki; Kt) 

Ki-y : normal deviate exceeded with probability 
1 - y, 336 

JFCo: normal deviate exceeded with probability 
po, 21 

K »; normal deviate exceeded with probability 
pi, 17 

Kt: normal deviate exceeded with probability 
ps, 17 

k: coefficient of s in variables inspection plan for 
fraction defective, 12-14, 16-26 
adjustment of, when N in fixed, 19, 20 
approximate evaluation by formula, 17-20 
comparison with exact values, 64-66 
exact evaluation, from indifference quality, 21 
using non-central l-distribution 71-82 
formulas for, 17-20 
derivation of, 59, 60, 62-63 
table of values, 22-25, 65 
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k: coefficient of a in variables inspection plan for 
fraction defective, uses of, additional, 27n., 
28n. 

k: number of items the range ( Rk ) of which is 
critical, 227, 228 

k: number of samples to which a given acceptance 
number ( Ak ) or rejection number (Rk) 
corresponds, 237 

(See also Acceptance number; Rejection 
number) 

kA : acceptance level for ko in sequential variables 
inspection for fraction defective, 84, 85 
Jfco: observed difference between upper limit (U) 
and sample mean ( x) in units of s; sample 
estimate of K, 43 

use in sequential variables inspection for 
fraction defective, 84, 85 
kit: rejection level for feo in sequential variables 
inspection for fraction defective, 84, 85, 
fc'; yfn times coefficient of s in acceptance 
criterion, used when items are eventually 
assembled in sets of n, 48 , 

Kelley, Truman Lee, 18, 93 

Kendall, Maurice G., 120n., 184, 196n., 199n., 222 
Kohn, Anne Frankenthaler, x 
Kurtosis, measures of ( 02 , 72 ). 197, 198 

of distribution of error of an interpolated value, 
221 

of sample mean when data are grouped or 
rounded, 197, 198 

L 

L: upper limit of a measured quality character¬ 
istic, above which an item is defective, 72 
(See also U) 

L p : probability that a lot of fraction defective p 
will be accepted by a given acceptance 
inspection plan if submitted, 15, Iff, 28 
evaluation of, 28-33, 50-52, 76, 77, 80, 358, 361, 
427 

use of, in determining confidence limits for p, 
43-45, 336 

L $; probability of concluding that 0 - 2 * < <ri 2 when 

*>* - 4*i 3 , 282 

curves showing, for F-testa of variance esti¬ 
mates, 296, 297, 318n. 

evaluation of, for F-tests of variance estimates, 
282-296, 316, 317 

bp: probability of concluding that vi 1 < <ro s when 
01 * ■» pa o s , 278 

curves showing, for x*-test of variance estimate, 
274, 275, 318n. 

evaluation of, for sequential sampling (eee La) 
for single-sample x^test, exact procedure, 
271-278, 279, 315 
large-sample formula for (see La) 

Lai probability of concluding that <n < at when 
9% - a, 232 

curve showing, for sequential test, 234 
evaluation of, for sequential sampling, 233 
for single-sample x*-test, exact procedure, 232 
large-sample formula for, 232 


Lt : probability of concluding that Pe > Ps when 
true angular difference between Pe and Ps 
is T, 262 

curve showing, 262 
evaluation of, by formula, 262 
from nomogram, 263 

Lb: probability of concluding from likelihood ratio 
sequenti 1 test that 0 < Bo, as a function of 
the true value of 0, 427, 428 
Lp,a: probability that a lot of mean p and standard 
deviation a will be accepted by a given ac¬ 
ceptance inspection plan if submitted, 16, 52 
Lagrangian interpolation coefficients, 108, 109, 
219, 222 

X: natural logarithm of ratio to two probabilities 
or probability densities, computed after each 
observation in sequential analysis, 83 
X (f,to,t ): function of /, to, and e used in calculation 
of 5 (f,to,e) and t ( /,6,e) from tables of non¬ 
central i-distribution, 74, 75 
Xj: value of X (f,to,e) computed in determining 
Pi-a, 82 

X 2 : value of X (f,to,e) computed in determining p(j, 
82 

Landau, E., 408n. 

Langdon, W. H., 195n., 196n., 222 
Large samples, formulas and procedures valid for, 
17-21, 30-33, 43, 58-63, 81, 82, 101, 108, 109, 
148, 149n.f 155, 162-163, 169, 231, 232, 263, 
264, 278n., 336, 377-381, 393, 399-401, 407, 
408-413 

'accuracy of, 64, 65, 401-407 
when data are grouped or rounded, 196-203 
Least squares, application of, when cost of each 
measurement depends upon its magnitude, 
319-328 

vhen data are observations on cumulative dis¬ 
tribution, 343, 344 
references to, 343n., 352 
Lethality of drugs, 341 
Level of significance, 270, 271, 281 
Levine, Harriet, x, 6, 109, 149n., 184 
Levine, Myra, x, xi, 236 

Likelihood ratio, as basis for sequential tests, 83, 
419-421 

definition of, 420 

formula for logarithm of (X), in sequential bi¬ 
nomial test, 421 

in sequential double-dichotomy test, 424 
in sequential mean test, 9 known, 423 
a unknown, 84 

in sequential variables inspection for fraction 
defective, 84 

Limits, confidence (see Confidence limits) 
oontrol (see Control limits) 
of normal variation (see Tolerance limits) 
tolerance (see Tolerance limits) 

Lindley, D. V., 28In. 

Linear regression, when cost of each measurement 
depends upon its magnitude, 319-328 
when data are observations on cumulative dis¬ 
tribution, 343, 344 
references to, 343n., 852 
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Location, estimation of parameters of (see Esti¬ 
mation; Mean; Median) 

Logarithmic scale, 321 
Logarithmic transformation, 412-413 
(See also z- transformation) 

Logarithms, of factorials, ,253 
reference to tables of, 2bSn. 

Lot, acceptability of, determined by average 
quality, 8, 27, 28, 61-63 
by dispersion of quality, 8, 27n., 28n., 229, 
269, 270, 278-280, 283 
by fraction defective, 7, 8 
by fraction of averages of samples of n that 
exceed specified limit, 48 
by fraction of ranges Rk in samples of k tnat 
exceed specified limit, 228, 229 
formulation of, in terms of a, 929-231 
by mean quality GO, 27, 28, 61-63 
by mean (m) <>nd standard deviation (c) of 
quality, 11 

by several quality characteristics, 50 52,113, 
114 

by standard deviation of qualify, f 27n., 
28n., 229, 2b9, 270, 278-280, 2k£ 
by upper and lowei limits on a single quality 
characteristic, 52- 56, 63, 64 
(See also Acceptable quality; AceeptaMe- 
quality level) 

acceptance of, meaning of, 1 -14 

procedure for (see Acceptance inspection) 
judgment of, from a sample or samples (see 
Acceptance criterion; Acceptance inspec¬ 
tion) 

quality of, choice of standards for, 46- 49 
specification of (see Lot, acceptability of) 
rejection of, meaning of, 13, 14 
submitted, probability of acceptance (see 
Operating-characteristic curve) 
variation of, from item to item, 8 
Lot tolerance percent defective (LTPD; p*), defi¬ 
nition of, 45, 229 

difference between true and intended, in varia¬ 
bles inspection for fraction defective, 21-26 
causes of, 21, 22 
discussion of, 21, 22, 25, 26 
table showing, 22-25 
selection of, considerations in, 45-49 
Lowan, Arnold N., x 

(See also Mathematical Tables Project) 
LTPD (see Lot tolerance percent defective) 

M 

Af: minimum value of a measured quality charac¬ 
teristic, below which an item is defective, 13 
m : two-point estimate of n based on probit data, 
350 

formula for, 350 
variance of, 351 

with optimum arrangement, 352 
Marginal totals, held fixed in analysis of 2 X 2 
table, 251 
Martin, Celia S., x 

Massachusetts Institute of Technology, ix 


Mathematical Tables Project, x, 10n„ 31n., 83n. # 
93, 108, 109, 209n., 219n„ 222, 255n., 260n., 
265 

Mather, K., 414n. 

Matrices, numerical inversion of, 172-173 
Maximum of a function, 365 
design of experiments for determining, 363-372 
factorial, 366, 367, 369-372 
when form of function is known, 371 
sequential, 367-372 
single sample, reference to, 365»., 372 
use of preliminary sample in, 371 
Mean, arithmetic (see Arithmetic mean; Sample 
mean) 

of binomial distribution, 397, 410 
of grouped distribution, 193, 164 
of lot (/i), as "in asurc of lot quality, 8, 27, 28, 
61-63 

of normal distribution, 9, 12, 61, 97, 161, 197, 
198, 342, 378, 423 

variance of maximum likelihood estimator 
of, when data are grouped, 197 
of Poisson distribution, 411 
of population (p), bias of sample mean as 
estimator of, when data are grouped or 
rounded, 187, 188, 212, 213 
confidence limits for, a known, 197, 198 
<r unknown, 99, 200-203, 213-215 
decisions regarding, based on sign of differ¬ 
ence between sample means, 377-382 
estimation of, by sample mean, 162, 187, 188, 
192, 196-198, 203, 212, 213 
from observations on cumulative distribu¬ 
tion, 342-352 

optimum design of experiments for, 
345, 346, 349-356 

importance of distinguishing from sample 
mean (£), 12 
as parameter, 12, 161 

tests of significance regarding, sequential, a 
known, 423 

a unknown, 83-85, 426 
single sample, a known, 61, 62 

when data arc grouped or rounded, 
197 198, 212, 213 
a unknown, 61-63, 71-73 

when data are grouped or rounded, 
200-203 

variance of maximum likelihood estimator of, 
when data are grouped or rounded, 167 
of r, 413 
of «, 58 
of s\ 192, 412 

when data are grouped or rounded, 199, 212, 
213 

of sample, as estimator of population mean, 12, 
162, 187, 188, 192, 196-198, 203, 212, 213 
bias of, when data are grouped or rounded, 
187, 188, 203, 212, 213 
(See also Expected value) 

Mean deviation (mean absolute deviation), 
definition of, 162 
as estimator of <r, 162, 163, 169 
Mean error (see Mean deviation) 
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Mean squares, in analysis of variance, 312 
Measurement, errors of, 188, 189 
Median, of sample, 9 

as estimator of population median, 162 
Merrington, Maxine, 148A., 151n., 152n., 184, 316, 
317, 318, 388n., 393, 394 
Metallurgical experiments, 321, 365 
design of, when cost of each measurement 
depends upon its magnitude, 322-328 
when object is to determine composition 
yielding maximum performance, 363-372 
Molina, E. C., 109 
Moments, about origin, 193 

effect of grouping or rounding on, 193, 194 
about mean, 195 

effect of grouping or rounding on, 195, 196 
average, when grouping lattice is located at 
random, 194, 195 
centroidal, 196 

effect of grouping or rounding on, 195,. ^96 
of distribution of a 2 , when data are grouped or 
rounded,199 

of sample mean, when data are griped or 
rounded, 198 

of t , when data are grouped or rounded, 201, 
202 

of grouped distributions, 193-196 

average, when grouping lattice is located at 
random, 194, 195 

effect of location of rounding lattice on, 194, 
195, 196, 206-213 
ideal, 193 
raw, 193 
realistic, 193 

Mosteller, Frederick, 238n., 244, 414n. 

I*: mean of lot or population (see Mean, of lot; of 
population) 

Multiple of a, for setting tolerance limits (see K) 
in variables inspection for fraction defective 
(see k) 

Multiple of <r, determined by AQL and U, 11 
determined by difference between IJ and n, 9-13 
determined by proportion of population to be 
included between tolerance limits, 97 
Multiple sampling, attributes inspection for frac¬ 
tion defective, 237-244 

average amount of inspection (ASN), com¬ 
pared to double and sequential sampling, 
238-244 

computational methods, references to, 244 
figures showing, 240, 242 
characterized by relation of Rh to Ak, 237 
variability of amount of inspection <r(A0 com¬ 
pared to double sequential sampling, 
238-244 

computational methods, references to, 244 
figures showing, 241, 243 
Multivariate quality control, applied to air testing 
of bombsights, 113-184 
analysis of certain test bombingB, 168-183 
arrangement of bombings, 121, 122 
control charts, 122-142 

actions, to be taken when “out of control," 

oheck list of, 140 


Multivariate quality control, control charts, 
figures showing, 124, 126, 128, 130, 132, 
134, 136 

formulas and procedures for constructing, 
147-156 

illustrations of use of, 135-142 
revision of, considerations affecting and pro¬ 
visions for, 142-147 

simultaneous use of several sample criteria, 
50-52, 113, 114 

use of single “score" or “index," 51, 113, 114 
based on generalized Student’s T, 114-116 

Muzzle velocity, tolerance limits for, 100, 101 

N 

N: sample size needed, in analysis-of-variancc 
tests, 311-314 

in attributes inspection, 259, 260n., 263, 264 
when severity of any one test cannot be con¬ 
trolled precisely, 331-338 
for binomial test, one-sided, 259, 260n., 263, 264 
for x 2 test of variance estimate, 231, 274, 275, 
278 

for determining whether level of performance of 
a process or product exceeds standard, 17, 
28n., 59-63, 81 

for determining whether variability of per¬ 
formance of a process or product exceeds 
standard/* 231, 274, 275, 278 
is less than standard, 278-280 
for fraction defective tests, based on attributes, 
259, 260n., 263, 264 
based on variables, 17 

when n is known and a unknown, 27n., 
28n., 231, 274, 275, 278n. 
when n is unknown and a known, 27n., 
28n., 61, 62 

when n and <r are both unknown, 17-21, 
27n., 28n., 59-66, 81, 82 
in general one-parameter tests, 61, 62 
for mean test, <r known, 27n., 28n., 61, 62 
cr unknown, 17, 27n., 28n., 59-63, 81, 82 
for proportion test, 259, 260n., 263, 264 
when range in samples of k determines lot 
quality, 231 

for standard deviation tests, 231, 274, 275, 
278-280 

in variables inspection for fraction defective, 
17-21, 27n.-28n., 59-66, 81, 82, 231, 274, 
275, 278n. 

N: size of each of two samples needed, for com¬ 
parison of two means, a known, 61-62, 380 
a unknown, 17, 27n., 28n., 59-63, 81, 82 
for comparison of two proportions or percent¬ 
ages, 259, 260, 263, 264 
nomogram showing, 260 
for comparison of two standard deviations or 
variances, 280, 282, 283, 296 
when decisions are based on signs of differences 
between sample means, 380 
for determining whether level of performance of 
one process or product exceeds that of 
another, 17, 27n„ 28n„ 59-63, 81, 82, 380 
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N : size of each of two samples needed, for deter¬ 
mining whether variability of performance of 
one process or product exceeds that of 
another, 280, 282-283, 206 
for difference of means test, <r known, 61, 62, 
380 

v unknown, 17, 27n., 28n., 59-63, 81, 82 
for difference of proportions or percentages 
test, 250, 260, 263, 264 
nomogram showing, 260 
for double-dichotomy test, 259, 260, 263, 264 
nomogram showing, 260 
for F-tests of variance estimates, 280, 282, 283, 
296 

77: average number of observations needed, 233, 
238, 420, 427 
(See also ASN curve) 

N a : sample size needed with single-sampling attri¬ 
butes inspection plan, 26, 27, 259, 260n„ 263, 
264 

Nr. number of observations in i-th sample »f a 
group of k samples. 43 

Nmi size of sample needed with variables in»f .ac¬ 
tion plan for /‘action defective when s is 
known, 27, 28n., 61, 62 

N»: size of sample needed with variables ii.apec- 
tion plan for fraction defective when p is 
known, 27n., 28n., 231, 274, 275, 278 
Nt : size of sample needed with variables inspec¬ 
tion plan for fraction defective when p and or 
are both unknown, 17-21, 26-28, 27n. f 28n., 
59-63, 81, 82 

N number of sets of n items each in a sample, 48 
n: degrees of freedom, in analysis of variance, 313 
of x 2 , 231, 270, 316, 387-389 
of s or 150, 152, 200, 203, 231-232, 387, 388- 
389 

of T, 145, 155 
of t, 148, 150 

n: number of items constituting a set which is 
judged defective or nondefective on the basis 
of the arithmetic mean of the set, 48 
n: number of variables upon which the quality of 
an item depends, 50 

n: size of preliminary sample in attributes inspec¬ 
tion for fraction defective when severity of 
any one test cannot be controlled precisely, 
331, 336-338 
N — 1 versus N , 13, 58 
Nair, U. S., 386n., 393 

National Bureau of Economic Research, iii, ix 
National Bureau of Standards, iii, ix-xii, 93, 109, 
186, 222, 265, 268, 376, 384, 396 
(See also Mathematical Tables Project) 
National Defense Research Committee, vii 
Applied Mathematics Panel, vii, x 
Navy, viii 

New method, process, or product, compared with 
old, with respect to fraction defective, 249- 
264. 424, 425 

with respect to level of performance, 59-63, 
380 

with respect to variability of performance, 
280, 282, 283, 296 


New method, prooess, or product, compared with 
standard, with respect to fraction defective, 
7-93, 27n., 28 ti., 231-234, 237-244, 270, 
271, 274, 275, 27&-280, 423 
with respect to level of performance, 17, 27n., 
28n., 59-63, 81, 82, 423 
with respect to variability of performance, 
227-234, 270, 271, 274, 275, 278-280 
New York Mathematical Tables Project 
(NYMTP) (see Mathematical Tables Project) 
Newton’s method, 109 
Neyman, Jerzy, 271n., 318 
Nomogram, for analyzing binomial data, 260 
Zubin’s, references to, 398, 416 
Nomograph (see Nomogram) 

Non-central t, 8»., 22, 64-66, 71-82, 83n., 85n. 
definition of, 71-73 
distribution of, 72, ^3 

Johnson and Weljh tables of, description of, 
73, 74 

notation used in, 72, 73 
procedures for using, 74-76 
relation to Student’s t, 72 

use in sequential variables inspection for frac¬ 
tion defective, 83n., 85n. 
use in single-sample variables inspection for 
fraction defective, 64-66, 71-82 
for evaluating k, procedure for, 76, 81, 82 
worked example, 77-79 
work sheets, 77-79 
for evaluating N, procedure, 81, 82 
for evaluating points on OC curve, procedure, 
76, 77 

worked example, 80 
work sheet, 80 
Nondefective, 7 

Non-normal distributions, 60, 51, 99, 377, 378, 
381, 382 

best discriminant functions for discriminating 
between, 381, 382 
tolerance limits for, 99, 109 

(See also Normality assumption) 

Normal deviates, exceeded in absolute value with 
given probability, 148 

relation to percentage points of x 2 -distribu- 
tion, 258n. 

tables of, references to, 10n., 18n., 148n., 149n. 
exceeded with given probability (see K a > Kfi, 
Kx-y, Ko, Xu and Ki) 
relation to probits, 343 
short table of, 18 

tables of, references to, 10n., 18, 343n. 
need for care in using, lOn. 
probabilities corresponding to given, short 
table of, 31 

tables of, references to, 10n., 31 
need for care in using, lOn. 

Normal distribution, 8, 9-13, 17, 18, 21, 31, 49, 
50, 58, 61, 02, 64, 73, 83, 97, 98, 100-109, 
161-163, 197-214, 221, 229, 255, 263, 264, 
312, 336, 341-344, 350, 377-381, 412, 413, 
423, 426 

(See alto Normal deviates; Normality assump¬ 
tion; Normally-distributed test function) 
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Normal distribution, basis for "probits,” 343 
bias of mean deviation as estimator of standard 
deviation of, 169n. 

bias of a as estimator of standard deviation of, 
68 , 162 

adjustment factor, formula for, 58 
tables of, references to, 58n. 
bias of a a as estimator of variance of, when data 
are grouped or rounded, 199 
in biological assay, 342 
bivariate, 146 

correlation coefficient of, 413 
relation of T to exponent of, 114-115 
characteristics of, 9-13, 97, 98, 161-163, 198, 
199, 200, 202, 221, 229, 312, 378, 379, 381 
cumulative, estimating n and <r from data on, 
339-352 

optimum design of experiments for, 345, 346, 
349-352 

departures from, seriousness of, in statistical 
analyses, 49, 50, 99, 198, 199, 202, 346, 
377, 381 

dependence of proportion exceeding given 
limit on the parameters of, 10 
(See also Normal deviates) 
in dosage-mortality studies, 342 
of errors of tabular values, advantages that 
would accrue from, 221 
estimation of the parameters of, 12, 58, 161- 
163, 169 

from grouped or rounded data, 187, 188, 
196-203, 212-214 

from observations on cumulative distribu¬ 
tion, 339—352 

expected value, of mean deviation in largo 
samples from, 169 

of 8 in samples from, formula for, 58 
of St + ke in samples from, 58 
figure, schematic, 10 
formula for, 161, 423 

for partial integral of, 17, 18, 31, 260, 263 
generalized, 197 

(See also Gram-Charlier series) 
grouped, distribution of 8 * in samples from, 
1987200, 203-214 

of t in samples from, 200-203, 214, 215 
of 2 in samples from, 196-198, 203, 212, 213 
independence of 2 and a 2 in samples from, 200 
effect of grouping on, 187, 188, 201 
kurtosis, measured by fit or 72 , 197, 198 

of distribution of means from, when data are 
grouped or rounded, 197, 198 
mean of (/*), 9-12, 61, 97, 161, 162, 197, 198, 
342, 378, 423 

estimation of, from observations on cumula¬ 
tive distribution, 342-352 
from probit data, 342-352 
by sample mean, 162, 187, 188, 192, 196- 
198, 203, 212, 213 
by sample median, 162 
tests of significance regarding, sequential, a 
. known, 423 
v unknown, 83-85, 426 


Normal distribution, mean of (m). test of signifi¬ 
cance regarding, single-sample, c known, 
61, 62, 197, 198, 212, 213 
<r unknown, 61-63, 71-73, 200-203 
variance of maximum likehood estimator of, 
when data are grouped or rounded, 197 
parameters of, estimation of (see Normal distri¬ 
bution, estimation of parameters of; mean 
of; standard deviation of; variance of) 
notation for, emphasizing distinction from 
sample estimates, 12 

tests of (see Normal distribution, tests of 
significance regarding parameters of; mean 
of; standard deviation of; variance of) 
range in samples of k, cumulative distribution, 
figure showing, 230 
tables of, references to, 229, 230, 234 
distribution of, 234 

depends only on k and <r, 229 
as estimator of or, 227 
mean values of, 227n., 234 
standard deviation of, 227n., 234 
standard deviation of (<r), 9-12, 97, 161, 229, 
342, 426 

estimation of, from mean deviation, 162, 163, 
169 

from observations on cumulative distribu¬ 
tion, 342-352 
from probit data, 342-352 
from range, 227 
from s, 12, 58 

(See also Normal distribution, variance 
of) 

tests of significance regarding, sequential, 
232, 233, 426 

single-sample, 200, 231, 270, 271, 274, 275, 
278-280 

standard deviation of s in samples from, 58 
of x + ka in samples from, 58 
tables of, 18, 31 

need for care in using, lOn. 

references to, 10n., 18, 31, 148n., 149n., 343n. 

types of, lOn. 

tests of significance regarding parameters of, 
sequential, 83-85, 232, 233, 423, 426 
single-sample, 61-63, 71-73, 197, 198, 200- 
203, 212, 213, 231, 270, 271, 274, 275, 
278-280 

(See also Normal distribution, mean of; 
standard deviation of; variance of) 
variance of, 161, 198, 199, 200, 212-214 
definition of, 161 

estimation of, by mean deviation, 162, 163, 
169 

by ««, 162, 198-200, 208-214 
tests of significance regarding, sequential, 
232, 233, 426 

single sample, 200, 231, 270, 271, 274, 
275, 278-280, 282, 283, 296 
(See also Normal distribution, standard devi¬ 
ation of) 

Normal inspection, 43-45 
acceptable-quality level for, choice of, 45-49 
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Normal law (see Normal distribution) 

Normal-probability function, tables of, 10n., 18, 
31, 148n.-149n., 343n. 
need for care in using, lOn. 
types of, lOn. 

Normal-probability paper, 414 

Normal surface (see Normal distribution, bivari¬ 
ate) 

Normality assumption, avoided by use of attri¬ 
butes inspection, 8, 229n. 
by use of distribution-free tolerance limits, 
99, 109 

dependence upon, of analysis of probit data, 346 
of moments of f-distribution, 201 
of OC curve for variables inspection plan ior 
fraction defective, 49, 50 
of tolerance-limit factor K, 99 
of £i — £t as best decision function regarding 
sign of m -* 381 

importance of, in air testing of bombsigh's, I4i. 
in analysis of positivity data. 346 
in evaluating probability that sample M»eam. 
will be in opposite order to pop- *tk>n 
means, 3378 
in setting tolerance limits, 99 
in variables inspection for fraction defective, 
49, 50 

use of, when severity of any one test cannot be 
controlled precisely, 336 

Normally distributed test function, derivation of 
critical level (A) and number of observations 
needed (N) for, 61, 62 

Notation, as means of emphasizing distinction 
between parameters of population and 
sample estimates of these, 12 
used in variables inspection for fraction defec¬ 
tive, 85-96 

Nuisance parameters, 115 

Null hypothesis, 420 
in analysis of variance, 311, 312 
in mean tests, 386, 423 

in proportion tests, 249, 251, 264, 357, 358, 421, 
424 

in standard deviation or variance tests, 270, 
278, 281. 386, 387, 311, 312 

O 

OC (see Operating Characteristics) 

OC curves (see Operating-characteristic curves) 

OC surface (see Operating-characteristic surface) 

Office of Scientific Research and Development 
(OSRD), vii 

(See also National Defense Research Com¬ 
mittee) 

Omnibus tests, 386 

Old method, process, or product compared with 
new, with respect to fraction defective, 
249-264, 424, 425 

with respect to level of performance, 59-63, 
380 

with respect to variability of performance, 
280, 282, 283, 296 
(See alto Standard) 


One-decision sequential binomial test, 358 
One-sided criteria, use of a pair of, in variables 
inspection for fraction defective, 53, 54 
compared to use of a single two-sided criterion, 
52, 56 

figure showing acceptance and rejection regions 
in (x, «)-plane, 55 

One-sided or one-tail tests, of statistical hy¬ 
potheses, 52-57, 150, 152, 156, 426 
Operating characteristics (OC), 15, 16, 28-32, 
261-263, 271, 278 
definition of, 14, 261, 271 
Operating-characteristic curves (OC curves), 15, 
29, 234. 263, 274, 275, 296, 297, 379 
for analysis-of-variance tests, 313, 314 
for attributes infection, sequential sampling, 
'26, 427 

dnglc sample, 260n., 262, 263 
when averages in samples of n determine lot 
quality, 48 

or binomial test, one-decision (Rp), sequential, 
358 

one-sided, sequential, 426, 427 
single sample, 260n., 262, 263 
truncated, sequential, 361 
for x z -test of variance estimate (L p ,Lo), 231, 
232, 271-279, 315, 318n. 
figures showing, 274, 275, 315n. 
large-sample formula for, 232 
tables for constructing, 272, 273, 276, 277 
when decisions are based on signs of differences 
between sample means [P(6,N)] t 378-380 
figure showing, 379 
table for constructing, 378 
definition, 15, 16, 28, 41, 42, 232, 278, 358, 427 
detailed, for single-sample variables inspection 
plan for fraction defective, determining, 
30-33 

determines what fraction of production sub¬ 
mitted will be accepted, 47 
as determining factor in cost of quality, 46 
for difference of means test, v known, 61, 62 
a unknown, 30, 59-61, 62, 63, 71-73, 76, 77, 
80 

for difference oi proportions test, sequential, 
424, 426, 427 

single sample (Lt), 262, 263 
figure showing, 262 

for double-dichotomy test, sequential, 424, 426, 
427 

single-sample (Lt), 262, 263 
figure showing, 262 

for double-sampling plans, reference to, 238 
effects of changes in acceptance critejjon on, 
21, 45 

of changes in sample size, 21, 45 
of reduced inspection, 45 
of simultaneous use of several acceptance- 
rejection criteria, 50-52, 113, 114 
of tightened inspection, 45 
for experiments for determining whether 
fraction defective of one process or product 
exceeds another, sequential, 424, 426, 
427 
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Operating-characteristic curves (OC curves), for 
experiments for determining whether 
fraction defective of one process or 
product exceeds another, single-sample 
(Lr). 262, 263 
figure showing, 262 

whether level of performance of one process 
or product exceeds another, 30, 59-63, 
71-73, 76, 77, 80 

whether variability of performance of one 
process or product exceeds another (L^), 
281-283, 296, 316, 317, 318n. 
figures showing, 296, 297, 318n. 
tables for constructing, 284-295, 298-309 
whether variability of performance of one 
process or product exceeds standard 
(Lp,L<r), sequential, 233 
single sample, 231, 232, 271-278, 315, 
318n. 

figures showing, 234, 274, 275, 315h*. 
large-sample formula for, 232 
tables for constructing, 272, 273, 276, 
277 

whether variability of performance of one 
process or product is less than standard, 
279 

for F-tests of variance estimates (Lq), 281,283, 

296, 313, 314, 316, 317, 318n. 
figures showing, 296, 297, 318n. 

tables for constructing, 284-296, 298-309 
figures showing, 15, 29, 234, 263, 274, 275, 296, 

297, 379 

references to, 238n,, 318n. 
five-point, 28-30, 427 
figure showing, 29 

for fraction defective tests (see Operating-char¬ 
acteristics curves, attributes inspection; 
variables inspection) 

for general one-paraineter tests, sequential 
(LB), 426-427 
single sample, 61, 62 

information provided by, 15, 16, 28, 41, 42, 278, 
358, 427 

interpretation of, 15, 16, 28, 41, 42, 232, 278, 
358, 427 

location of, 21, 45 

for mean test, single sample, a known, 61, 62 
v unknown, 30, 59-63, 71-73, 76, 77, 80 
of multiple-sampling plans, references to, 238 
steps in computation of, outlined, 361 
for one-decision sequential binomial test ( R p ), 
formula for, 358 

ordinate of, 15,16, 28, 41, 42, 232, 278, 358, 427 
for proportion test, sequential sampling, 426, 
427 

single Bample, 2G0n., 262, 263 
when range in samples of k determines lot 
quality (Lp,Lo), sequential sampling, 233 
figure showing, 234 

single sample, 231, 232, 271-278, 279, 315, 
*318n. 

figures showing, 274, 275, 315n. 
for seqttential-sampling plans, 233, 358, 361, 
426, 427 


Operating-characteristic curves (OC curves), for 
sequential-sampling plans, general proce¬ 
dure for constructing one-parameter tests 
(Le), 426, 427 
five-point, 427 

for one-decision tests (Rp), 358 
for specific tests (see the test concerned) 
for truncated tests, outline of steps for com¬ 
puting, 361 

when several acceptance-rejection criteria are 
used simultaneously, 50-52, 113, 114 
for single-sample inspection plans and tests (see 
the test concerned) 

as specification of what will be accepted, 46 
for standard deviation tests (L^,Lp,La), sequen¬ 
tial, 233 

figure showing, 234 

single sample, 231, 232, 271-278, 279, 281- 
283, 296, 315-317, 318n. 
figures showing, 274, 275, 296, 297, 315n. 
large-sample formula for, 232 
tables for constructing, 272, 273, 276, 277, 
284-296, 298-309 
steepness of, 21, 45 
typical, figure showing, 15 
when upper and lower limits are imposed on a 
single quality characteristic, 52 
use of, to determine what fraction of production 
submitted will be accepted, 47 
use in selecting plan of action, 15 
for variables inspection plans for fraction defec¬ 
tive, 15, 16, 21, 28-33, 41, 42, 45-50, 52, 
65, 66, 71-73, 76, 77, 80 
when averages in samples of n determine lot 
quality, 48 

computation of, from large-sample formula, 
30-33 

from non-central t distribution, 71-73, 76, 
77, 80 

dependence on normality assumption, 49, 50 
effect of changes in N or k on, 21, 45 
figure showing, 29 
large-sample formula for, 30 
plotted as function of K * (U — \l)/v, 49 
Operating-characteristic surface, 16, 52, 261, 262 
Order of sample means, number of observations 
needed when decisions are based on, 380, 381 
optimum decision function, 381, 382 
probability opposite to population means, 377- 
380 

figure showing, 379 
formula for, 379 
table giving, 378 
Ordered magnitudes, 367, 368n. 

Ore, Oystein, 195n., 196n., 222 
Origin, shifting, to simplify graphical procedure 
for variables inspection, 36, 37, 41, 54, 69-71 

P 

P : parameter of binomial distribution, 250, 251 
P : proportion of items with given attribute in 
infinite population, 255. 256, 397, 410 
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P: proportion of population or lot to be included 
between tolerance limits, 47n. f 98, 99 
P : true long-run probability of “success” in any 
single trial, 250, 251 
(See aleo p) 

Pb: value of P for experimental or new method, 
process, or product, 259 

Pb‘ value of P for standard or old method, process, 
or product, 259 

Po: probability or probability density correspond¬ 
ing to observations in hand, if null hypothe¬ 
sis is true, 420 

as denominator of likelihood ratio, 420 
use of logarithm of, to simplify sequential 
analysis, 421 

Pi: probability or probability density correspond¬ 
ing to observations in hand, if alternative 
hypothesis is true, 420 
as numerator of like'ihood ratio, 420 
use of logarithm f, to simplify sequential 
analysis, 421 

P(S,N ): probability sample mean.- based W JV 
observations wdl be in opnooite i nder to 
population m<’> n* when latter differ K: o 
standard deviation units, 378, 379 
figure showing, 378 
table of, 379 

PA: process average percent defective, 42 
p: fraction defective in lot, 15 
p: observed proportion, 255, 397, 410 
p: parameter of binomial distribution, 421 
p: true long-run proportion of “successes” with a 
given method, 421 

pi-«: the fraction defective for which the prob¬ 
ability of acceptance is 1 — a with a given 
sampling plan, 82 

p/3: the fraction defective for which the probability 
of acceptance is fi with a given sampling plan, 
82 

ps : observed proportion of successes with experi¬ 
mental method, 256 

ps: observed proportion of successes with stand¬ 
ard method, 256 
po: indifference quality, 20, 29 
po: value of p when null hypothesis is true, 421 
pi: acceptable-quality level, 16, 229, 357, 359 
(See also Acceptable-quality level) 
pi: value of p for first of two methods, processes or 
products, 424 

pi: value of p when hypothesis alternative to 
p -> po is true, 421 

ps: lot tolerance percent defective, 16, 229, 357, 
359 

(See aleo Lot tolerance percent defective) 
ps: value of p for second of two methods, 424 
p': fraction defective for which probability of' 
acceptance (Lp f ) is 1 — L t >, 32 
Pairman, Eleanor, 194n., 222 
Parameter, of binomial distribution (P,p), 250, 
251, 421 

of Poisson distribution, 411, 426 
of population, distinguished from statistic and 
sample estimate, 12, 161 
Park, R. H., 335n. 


Paulson, Edward, ix, xi, 99, 109, 248, 393, 394, 
419n. 

Peach, Paul, 48n., 93 
Pearse, G. E., 194n., 223 

Pearson, Egon Sharpe, 166n., 184, 229, 230, 234, 
271n., 318, 335n., 386n., 389n., 393, 394 
quotation from article by, on “omnibus” tests, 
386 

quotation from letter of, on accuracy of con¬ 
fidence belt charts, 335 
Pearson, Karl, 194n., 222, 223 
Pegrain, George B., x 
Peifier, A. M., 149n., 184 
Percent defective (eee Fraction defective) 
Percentage points of statistical test functions (eee 
Signifi canoe level *) 

Percentages (eee Proportions) 

*p\ transformed propoition, 398, 399 

(See aleo Inverse-sine transformation) 

<PB: transformed proportion with Bartlett’s 
adjustment, 401 

(See aho Bartlett’s adjustment; Inverse-sine 
transformation) 

•£: the ratio of two variances (<n % /tri t ), 281 
</»(«,formula for, 282 

derivation of, 281n., 316, 317 
special cases of, 282 
tables of, 284-295, 298-309 
t: r tio of circumference of a circle to its diameter, 
189 

value correct to 20 decimal places, 189n. 
value expressed as common fraction, correct to 
6 decimals, 189n. 

Pitman, E. J. G., 386n., 394 

Plan of action, of an acceptance inspection plan, 
definition of, 13 

for locating maximum of a function, sequential 
procedure for, 367-369 
suitability of, shown by OC curve, 15 
for variables inspection for fraction defective, 
outline of, 14 

procedures for, 16-21, 33-41 
Planning experiments, 245-372 (eee Design of 
experiments) 

Plans for experiments (eee Experiment designs) 
Poisson distribution or series, relation of partial 
sum to percentage points of x a t 100 
mean, variance, third and fourth moments of, 
411 

sequential procedure for testing parameter of, 
426 

tables of, reference to, 109 
transformation of, to stabilize variance, deriva¬ 
tion of, 411, 412 

(See aleo Square-root transformation) 
Population, or universe, 12, 98, 227, 251, 341, 355, 
377 

(See aleo Lot) 

Population mean (/*,{), 12, 161 

(See aleo Confidence limits, for p; Estimation, 
of population mean; Mean, of population; 
Statistical tests, relating to population 
mean) 
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Population proportion (P,p), 256-256, 397, 410 
(See also Binomial distribution, parameter of; 
Confidence belts; Confidence intervals; 
Confidence limits; Estimation; Proportion, 
population; Statistical tests) 

Population standard deviation (a), 12, 161 

(See also Confidence limits; Estimation; 
Standard deviation, of population; Statis¬ 
tical tests) 

Population variance (or*), 12, 161 

(See also Confidence limits; Estimation; 
Statistical tests; Variance, population) 
Power function, of a statistical test procedure, 427 
(See also Operating-characteristic curves, 
Operating-characteristic surface) 

Precision (see Standard deviation; Variance) 
Preliminary sample, use of, in attributes inspec¬ 
tion when severity of any one test cannot be 
controlled precisely, 336-338 
in planning regression studies, when -tost of 
each measurement depends upon its 
magnitude, 321-328 

in planning sensitivity tests, 345,< 346, 349- 
352 

Preliminary trial of an examination, evaluating, 

280 

Princeton University, ix 
Probabilities, compounding, 167 
Probability, absolute, 251 

of acceptance, 15, 16, 28-33, 41, 42, 45, 50-52, 
232, 233, 261-263, 271, 278, 279, 282, 315- 
317, 358, 361, 427, 428 
(See also Operating characteristics; Operat¬ 
ing-characteristic curves; Operating- 
characteristic surface) 

effect upon, of changes in acceptance cri¬ 
terion, 21, 45 

of changes in sample size, 21, 45 
of reduced inspection, 45 
of simultaneous use of several acceptance 
criteria, 50-52, 113, 114 
of tightened inspection, 45 
evaluation of (see L p ; L$\ Lp ; La', Lr\P(6,N) \ 
Computation, of OC curve) 
conditional, 251 

density tfr density function, 83, 155, 161, 193- 
195, 217, 270, 281, 342, 392 
definition of, 193n. 

distributions, 10, 61, 62, 72, 73, 99, 109, 114, 
154, 155, 193-195, 197-199, 202, 214, 215, 
229, 230, 251, 270, 281, 342, 392 
element of, 155, 161 

(See also Probability, density) 
function of (see Probability, distributions) 
that gun barrel will be proofed, 332 
paper, arithmetic normal, 414 
binomial, 414n. 

that proximity fuse will operate, 342 
that sample means will be in opposite order to 
population means, 378-380 
Probability-wise interpolation, 316n., 318 
Probits, analysis of data expressed as, 342-344 
definition of, 343 

estimation of m and # from, 343, 350 


Probits, experiments leading to data expressed 
as, 341, 342 

optimum design of, 345-346, 349-352 
relation to normal deviates, 343 
tables of, references to, 343n., 352 
variance of, formula for, 350 
Process, average (see Process average percent 
defective) 

new (see New method) 
old (see Old method) 

quality level of (see Process average percent 
defective) 

standard (see Standard method) 

Process average percent defective (PA), 42-45 
computation of confidence interval for, from 
attributes data, 333-336 
from variables data, 42-45 
as criterion for determining whether to main¬ 
tain or change volume of inspection, 42 
example of, 44 

estimation of, from attributes data, 333-336 
from variables data, 42-45 
Process control, contrasted with acceptance 
inspection, 156, 157 
importance of x and s in, 34 
(See also Quality control) 

Producer’s risk (a), conventional value of (5%), 
45n. 

definition o^ 16, 228, 229 

maximum of, with variables inspection plans 
for fraction defective, 16, 17 
- selection of, factors to consider in, 45-49 
Product, new (see New method) 
old (see Old method) 

Product-moment coefficient of correlation, 413 
Production possibilities, analysis of, 46, 47 
Proof testing, statistical aspects of, 331-338 
Proofing gun barrels, 331-338 
Proportions or percentages, 15, 47n., 98, 99, 250, 
251, 255, 256, 259, 397, 410, 421, 424 
ASN curve for sequential one-sided test of, 
figures showing, 240, 242 
formula for, 427, 428 

confidence limits for (see Proportions, popula¬ 
tion) 

estimation of (see Proportions, population) 
expected value of, 397, 410 
experiments for comparing two, analysis of 
data from, 249-258, 424, 425 
approximate single-sample method, based 
on binomial test. 249, 250, 255 
based on x«. 253-255, 246-257 
based on inverse-sine transformation, 
257, 258, 263, 264 

based on normal deviates, 253-257, 263 
“exact” single-sample method, 250-253 
by Fulcher-Zubin Item Analyzer, 257, 
257n., 258 

incorrect method for, 256 
nomograph for, Zubin’s, references to, 398, 
416 

by sequential analysis, 424, 425 
design of, 258-265, 424, 426-428 
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Proportions or percentages, experiments for com¬ 
paring two, inverse-sine transformation of 
data from, 257, 258, 263, 264, 397-408 
number of observations needed, with sequen¬ 
tial sampling, 424, 427, 428 
with single samples, 2f 8-261, 264 
formula for, 258, 259, 264 
nomogram for, 260 
operating characteristics of, 261 
operating-characteristic curve (OC curve) of, 
determination, by formula, 262 
from nomogram, 263 
figure showing, 262 

operating-characteristic surface (OC surface) 
of, 261, 262 

planning of, 258-265, 424 
sequential, 424, 425, 426-428 
experiments for estiirating population or r,rue 
(see Proportion population) 
experiments for tc ting a single, analysis of 
data from, 260> , 263. 264, 42: 422 
ASN curves for double, multiple *»’nl sequen¬ 
tial sampling of. 240, 242 
design of, 259. 261-264, 424, 4*28 

inverse-sine transformation ot data ft im, 
397-408 

number of observation* needed, with double 
sampling, 237-244 
figures showing, 240-243 
with multiple sampling, 237-214 
figures showing, 240-243 
with sequential sampling, 26, 237-244, 
419, 420, 427, 428 
figures showing, 240-243 
formula for, 427, 428 

with single sampling, 14, 15, 26- 28, 259, 
260n., 263, 264 
examples, 26 

formula for, 259, 260n., 263, 264 
nomogram for, use of, 260n. 

OC curve of, approximate determination 
from formula, 260n., 262 
from nomogram, 260n., 263 
exact determination of points on, using 
confidence-belt charts, 336n. 
using percentage points of incomplete 
beta function, 334, 336n. 
planning of, 259, 260n., 261—264, 426-428 
sequential, 421, 422, 427, 428 
one-decision test, 358 
moments of, 410 

population, 15, 47n. t 98, 99, 250, 251, 421 
bias of estimation from <p(p) and <pb(p), 403, 
404 

tables showing, 400-402 
estimation of, confidence belts for, 332- 335 
confidence intervals for, based on attri¬ 
butes inspection, 332-336 
based on variables inspection, 42-45, 
336n. 

confidence limits for, evaluation from per¬ 
centage points of incomplete beta 
function, 333, 334, 337 
formula for, 335, 336 


Proportions or percentages, population, esti¬ 
mation of, experiments for, analysis of 
data from, 332-336 

experiments for, design of, 238, 333-337 
inverse-sine transformation of data 
from, 336 

number of observations needed, 333-337 
sequential, 238n. 

included between statistical tolerance 
limits, 47n., 98-99 
standard deviation of, 255, 397, 410 
tests of significance regarding (see Proportions, 
experiments for. . . .) 

transformation of, to stabilize variance, 257, 
258, 263, 264, 397-408 
(see I iverse-sine transformation) 
variance of, 397, 4!3 

Protection afforded l y sampling inspection, 46, 
47, 156-158 

Przyborowski, J., 46a., 93, 255n., 265 
Psychological considerations, 278 

Q 

Quality, acceptable, 61, 228, 229, 269, 278, 279, 
423 

(See a Iso Acceptable-quality level) 
of accepted lot or product, 16, 41, 42, 46, 150- 
158 

attainable, 46, 47 
bad (see Quality, unacceptable) 
choice of standards of, 45-49 
control of (see Quality control) 

(See also Control charts; Control limits) 
cost of, OC curve a determining factor in, 46-47 
desired, 45, 46 

evaluation of (see Quality, of lot; Quality, 
produced by process) 
feasible, 45-47 

good (see Quality, acceptable) 
high, 42, 43 
improvement of, 46 
information about, 42 

provided by process average percent defec¬ 
tive, 43 

by sampling inspection, 156-158 
by statistical tolerance limits, 47, 97-100 
of an item, 7 

of lot, determined by average quality of items 
of lot, 8, 27-28, 61-63 
by dispersion of quality of items of lot, 8, 

27, 28, 229, 269, 270. 278, 279, 280, 283 
by fraction defective, 7, 8 

combinations of n and a that correspond 
to same fraction defective, 11 
by fraction of averages of samples of n 
that exceed specified limit, 48 
by fraction of ranges Rk in samples of k 
that exceed specified limit, 228, 229 
formulation in terms of a, 229-231 
by mean quality (u) in conjunction with 
dispersion of quality (*), 11 
by mean quality (m) of items of lot, 8, 27, 

28, 61-63 
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Quality, of lot, determined by several quality 
characteristics, 50-52, 113, 114 
by standard deviation (<r) of qualities of 
items of lot* 8, 27n., 28n., 229, 269, 
270, 278, 279, 280, 283 
by upper and lower limits on a single 
quality characteristic, 52-56, 63, 64 
by variability of quality among items of 
lot, 8, 27, 28, 229, 269, 270, 278-280, 283 
evaluation of (see Confidence intervals; 
Confidence limits; Estimation; Toler¬ 
ance limits) 

judgment of, from a sample or samples (see 
Acceptance criterion; Acceptance in¬ 
spection) 

submitted, 16, 41, 42, 46, 156-158 
low, 42, 43 
maintenance of, 43n. 
measures of (see Quality, of lot) 
of outgoing product, 16, 41, 42, 46, 156-158 
poor, 42, 43 

produced by process, evaluation of (see Con¬ 
fidence intervals; Confidence limits; Esti¬ 
mation; Process average percent defective; 
Tolerance limits) 

information about, provided by process 
average percent defective, 43 
by sampling inspection, 156-158 
by statistical tolerance limits, 47, 97-100 
of a sample, determined by sample average, 48 
by sample range, 228, 229 
submitted, 16, 41, 42, 46, 156-158 
unacceptable, 45n., 47-49, 229 
Quality control, applied to air testing of bomb- 
sights, 113-184 

contrasted with acceptance inspection, 117, 
156-158 

feature, in procedure for going on reduced 
inspection, 43n. 
functions of, 117 
importance of £ and s in, 34 
multivariate, 113-184 

use of a single “score” or “index” in, 113, 
114 

based on generalized Student T, 114-116 
advantages of, 115 
objectives of, 156, 157 

(See also Control charts; Control limits) 

R 

Rk: acceptance number for the k th sample, 237 
(See also Rejection number) 

Rk: range of a sample of k items, 229 
(See also Range) 

r: observed product-moment coefficient of correla¬ 
tion, 413 

s-transformation of, to stabilize variance, 
derivation, 413 
Radio proximity fuze, 341 

analysis of test data, method of, 342-344 
design of tests of, optimum, 345-352 
Random selection, of bombsights, reasons for, 158 


Random variation, components of, 269, 297, 310 
311 

estimation of, and tests of hypotheses regard¬ 
ing, 311-314 

Randomization, methods of, 176, 177 
as precautionary measure, 118 

to prevent bias from time trends, 371 
restricted, 177 

role of, in design of experiments, 141, 142, 176, 
177 

Range, calculation of, 227 
definition of, 227 

distribution of, in samples from a normal 
population, 234 
cumulative, 229 
figure showing, 230 
tables of, reference to, 229, 234 
mean of, table, reference to, 227n. 
standard deviation of, 227n. 
as estimator of <r, 227 

efficiency of, in small samples from a normal 
population, 227n. 

as measure of dispersion of sample, 227 
reasons for use, 227 

in samples of k, primary characterization of lot 
quality, 227, 228 

acceptance inspection when, 227-234 
Range of normal variation, 97 
(See also > Tolerance limits) 

Rational sentencing of lots, 156, 157 
Rectangular distribution, 195, 217, 218 
- as representation of grouping or rounding 
errors, 195 

for tabular entries, 215-218 
semi-invariants of, 195, 196n. 
standard deviation of, 218 

ubo of, in checking sums and tables, example, 
218 

variance of, 195, 218 
Reduced inspection, 41-45 

criterion for changing to, when lot quality is 
measured by fraction defective, 42 
procedure for judging whether criterion is 
met, using variables data, 42-45 
when upper and lower limits are imposed 
on a single quality characteristic, 56 
effect of, on OC curve, 45 

methods of achieving, with variables inspection 
for fraction defective, 45 
purpose of, 41, 42 

Redundant requirements in specifications, 50n. 
Rees, Mina, x 

Regression analysis, when cost of each measure¬ 
ment depends upon its magnitude, 319-328 
when dependent variable is expressed in probits, 
342-344, 349-352 

Rejection criterion, for sequential likelihood ratio 
tests, 420, 421 

for sequential variables inspection for fraction 
defective, 83, 85 

for single-sample tests (see Acceptance criterion) 
(See also Rejection line; Rejection number) 
Rejection line, 232, 357-360, 422, 423, 425 
adjusting, to allow for acceptance line, 358-360 
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Rejection line, figure showing, 357 
for number of "successes" in binomial test, 422 
for number of "success-failure” pairs, in double¬ 
dichotomy test, 425 
for one-decision binomial test, 358 
for sum of observations, in test for n, v known, 
423 

for sum of squared observations, in test for <r, n 
unknown, 232 

Rejection number ( Rk , Si), 237, 239, 244, 358, 
422, 425 

definition of (Rk), 237 
in double sampling, 237 
examples, 239, 244 
in multiple sampling, 237 
examples, 239, 244 

relation to acceptance number (.1*) charac¬ 
terizes type of sampling plan, 237 
in sequential sampling (Rk), 237 
in sequential tee * (Si), 422, 425 
in sequential tgs'9 that provide for rOicti'm 
only, 358 

in single sampling, 237, 260n , 2u3, 2<«4 
Rejection region boundaries of, for se < ^ ntn<1 
tests (see Rejection line) 
for single-oampie tests (.see Acceptance region, 
boundarice of) 

Relationship, determining, when cost of each meas¬ 
urement depends upon its m ; gnltude, 319-328 
Replication, dual role of, in experimentation, 159 
p: product-moment coefficient of correlation in 
normal bivariate population, 413 
p: ratio of population variances (<ri 2 /<ro 2 ), 271 
p(a,0,m), definition of, 271 

large-sample formula for, 278n. 
tables of, 272, 273, 276, 277 
use of, examples of, 271, 274, 275, 278 
p' (a,P,n i), definition of, 279 
relation to p(a,/3,ni), 279 
use of, example of, 280 
Rider, Paul R., 386n., 394 
Rietz, Henry Lewis, 196n., 197n., 218n., 223 
Risks, in use of sampling inspection, 156, 157 

(See also Consumer’s risk; Operating-char¬ 
acteristic curves; Producer’s risk) 

Robbins, Herbert, 109 
Rodermund, Edythe B., x 
Romig, Harry G., 27n., 93 

Rounded numbers, impossibility of avoiding use 
of, 188, 189 

measurements as, 187-189 
scale readings as, 188, 189 
tabular entries as, 191, 215-218, 221 
Rounding, 187-223 
accepted procedure for, 189, 190 
(See also Rounding Rule) 

Cayley’s procedure for, 191 
effect of, on descriptive statistics, 187, 188 
on distribution of error of an. interpolated 
value, 221 
of F, 214, 215 
of «», 198-200, 203-214 
of Student’s t, 200-203, 214, 215 
of «, 196, 198, 212, 213 
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Rounding, effect of, on moments of probability 
distributions, 193-196 
representation as effect of random 
"errors," 195 

on standard procedures for statistical infer¬ 
ence, 191-215 

impossibility of avoiding, 188, 189 

reasons for, 187-189 

repeated, consequences of, 190, 191 

(See also Rounding interval; Rounding lat¬ 
tice; Rounding rule; Sheppard’s correc¬ 
tions) 

Rounding interval, recommended size of, 192, 214 
reasons for, 214 

Rounding lattice, effect of position of, on dis¬ 
tribution of s 2 , 206-213 
on distribution of i, 212, 213 
random location of, 194 

extends validity of Sheppard’s corrections, 
196 

procedure for, when only non-negative values 
occur in population, 196 
Rounding rule, 189, 190 
Cayley’s form of, 191 
effect of repeated applications of, 190, 191 
elementary properties of, 190, 191 
importance of adhering to, 190 
R^y, S. N., 184 
Royalties, vii 

S 

8: sensitivity of a fuze, 342 

s : sample estimate of population standard devia¬ 
tion (a), based on sum of squared devia¬ 
tions from sample mean, 12, 13, 162,192 
adjustment of, 58 

advantages of, relative to mean deviation, 162, 
163 

relative to sample range, 227n. 
association of values of, with values of A, when 
data are grouped or rounded, 187, 188 
bias of, 58 

adjustment factor, formula for, 58 
tables of, references to, 58n. 
when computed from grouped or rounded 
data (see «*) 

combining valxies of, to estimate variability of 
process, 42-43 

computation of, 12, 13, 33, 34, 42, 43, 162, 192 
N — 1 versus N in, 13, 58 
naive procedure for, criticism of, 33 
using modern computing machines, 33,34 
definition of, 12, 13, 162, 192 
N — 1 versus N in, 13, 58 
degrees of freedom of, 12, 13, 43 
(See also « 2 , degrees of freedom of) 
division by, in computing, 12, 13, 43 
as denominator of Student’s t, 72, 200 

Sheppard’s corrections not to be used for, 203 
design of experiments for testing, 269-296 
distribution of, approximately normal for large 

N t 58 
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t: sample estimate of population standard devia¬ 
tion (<r), distribution of, mean of, for¬ 
mula for, 58 

tables of, references to, 58n. 
standard deviation of, formula for, 58 
tables of, references to, 58n. 

(See also * 2 , distribution of) 
effect of grouping or rounding of data on (see s a ) 
efficiency of, 162, 227 

expected value of, in normal samples, formula 
for, 58 

tables of, references to, 58n. 
formula for, 12, 13, 43, 162, 192 
grouping of data, effect on, 187, 188 (See also s 2 ) 
homogeneity of two or more values of, 385-389 
importance of distinguishing from <r, 12 
Importance of, in quality control, 34, 42-44 
largest of several values of, testing of, 383-394 
logarithmic transformation of, to stabilize 
variance, 412, 413 

mean of, in normal samples, formula for, 58 
tables of, references to, 58n. 
multiple of, for setting tolerance limits' (see K) 
used in variables inspection for fraction 
defective (see k) 

JV — 1 versus N, in definition and computa¬ 
tion of, 13, 58 

standard deviation of, in normal samples, 
formula for, 58 
tables of, references to, 58n. 
termed “sample standard deviation” for 
brevity, 12, 13 

tests of significance regarding (see s 2 ) 
transformation of, to stabilize variance, 412, 
413 

use of, in acceptance inspection for *, 231, 232, 
270, 271, 278-280 

in acceptance inspection when ranges in 
samples of k determine lot quality, 227- 
234 

in computing control limits for means, 147, 
148 

in estimating process average percent defec¬ 
tive, 41-44 

opposition to, rebuttal of, 33, 34 
in quality control, 34, 42-44, 151, 152 
in setting tolerance limits, 47 
in variables inspection for fraction defective, 
9-14, 33, 34, 41-44, 48, 53, 56, 58-64, 71- 
73, 83-85 

s: sample estimate of population standard devia¬ 
tion (<r), based on cumulative distribution 
(probit data), 343, 350 
variance of, 351 

with optimum experimental design, 352 
•: slope of parallel lines in graphical sequential 
analysis, 232, 233, 357, 358, 426, 427 
formula for, binomial test, one-decision, 358 
binomial test, one-sided, 422 
double-dichotomy test, 425 
mean test, v known, 423 
standard deviation test, m unknown, 233 
use in Constructing ASN ourve, 233 
five-point, 427 


«: slope of parallel lines, in graphical sequential 
analysis, use in constructing OC curve, 
233 

five-point, 428 

sk: observed number of “successes” in N trials 
with experimental method, 262 
88 : observed number of “successes” in N trials 
with standard method, 262 
s 2 : sample estimate of population variance (<r 2 ), 
12-13, 43, 162, 198 

bias of, when computed from grouped or 
rounded data, 198-200, 203-214 
computation of, 12, 13, 33, 34, 42, 43, 162, 198 
control limits for, 151 

correlation of, with x, when computed from 
grouped or rounded data, 187, 201 
definition of, 12, 13, 43, 162, 198 
degrees of freedom of, 12, 13, 43, 199, 203, 270, 
312, 387, 388 

design of experiments for determining and 
testing, 269-296 

distribution of, mean of, in sample from arbi¬ 
trary grouped or rounded populations, 199 
in large samples from arbitrary population, 
199 

effect of grouping or rounding on, 198-200 
in samples from a normal population, 199, 
270 

effect of grouping or rounding on, in large 
samples, 198-200 
in small samples, 203-214 
. variance of, in samples from arbitrary, 
grouped, and normal populations, 199 
effect of grouping or rounding of data on, 187, 
188, 198-200, 203-214 
efficiency of, 162 

expected value of, in samples from arbitrary, 
grouped, and normal populations, 199 
formula for, 12, 13, 43, 162, 198 
grouping of data, effect on, 187, 188, 198-200, 
203-214 

homogeneity of two or more values of, testing 
of, 385-389 

importance of distinguishing from <r 2 , 12 
independent of x in samples from normal 
population, 200 

effect of grouping or rounding on, 187, 188, 
201 

largest of several values of, testing, 383-394 
logarithmic transformation of, to stabilize 
variance, 412, 413 

mean of, in samples from arbitrary, grouped, 
and normal populations, 199 
moments of, in samples from a normal popula¬ 
tion, 412 

tests of significance regarding, sequential, 232, 
233, 426 

single sample, 231, 232, 269-296, 383-394 
transformation of, to stabilize variance, 412,413 
uses of, special (see s) 

variance of, in samples from arbitrary popula¬ 
tion, 199 

when data are grouped or rounded, 199 
in samples from normal population, 199 
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a 8 : sample estimate of population variance (<r s ), 
variance of, in samples from normal popula¬ 
tion, when data are grouped or rounded, 211 
yardsticks for, 152 
Salzer, H. E., 100, 416 
Sample average (see Samplu mean) 
as basts for decisions, 156-158 

(See also Consumer's risk; Operating char¬ 
acteristics; Operating-charaoteristic curves; 
Producer’s risk) 
mean (see Sample mean) 
median (see Median) 
random, 158-160 
representative, 160 
selection of, 158-160 
size of (see Sample size) 

Sample mean (£), 12, 43, 100, 102 

advantages of, relative to sample median, in 
normal sam» ' - i s, 161, 162 
bias of, when datj arc grouped oi roundt *. 187, 
188, 197, 2i: 

combining, from several sample, in sstir it mg 
process a\ rage percent defective, d ■ M 
computation of, 12. 13, 100, 102 
control limits for, 147 150 

correlation of, with « or when data mo 
grouped or rounded, 187, 188, 201 
definition of, 12, 43, 100, 102 
as descriptive statistic, effect of grouping or 
rounding on, 187-188 # 

design of experiments for testing, 17, 27 n., 28 n., 
60-66, 81, 82, 377-382, 423 
distribution of, for arbitrary population, 197, 
198 

effect of grouping or rounding on, 196-198, 
203 

for normal population, 162 
effect of grouping or rounding on, 187, 188, 196, 
198, 212, 213 

efficiency of, in normal samples, 162 

when data are grouped or rounded, 197 
as estimator of population mean (p), arbitrary 
population, 197, 198 

when data are grouped or rounded, 196- 
198, 212, 213 
normal population, 162 
(See also Confidence limits, for p) 
expected value of, 197, 198, 212 
importance of distinguishing from population 
mean (p), 12 
mean of, 197, 198, 212 

moments of, in samples from arbitrary, grouped, 
and normal populations, 197, 198 
use in quality control, 34, 42-44, 147-150 
in variables inspection for fraction defective, 
9-14, 33, 34, 41-44, 48, 53, 56, 58-64, 
71-73, 83-85 
variance of, 162, 197, 198 
yardsticks for, 150 

Sample mean deviation (tee Mean deviation) 
Sample median (eee Median) 

Sample size, adjustment of, by modifying OC 
curve, 19 


Sample size, average, 233, 238, 420, 427 
(See alto ASN curve) 
changes of, effect on OC curve, 21, 45 
for each of two samples (see JV, second listing) 
for single sample (eee N, first listing) 

(See alto Amount of inspection; Computa¬ 
tion of number of observations needed; 
Formula, for number of observations 
needed; Na, Nm, i\T», Ni) 

Sample standard deviation, 12, 13, 187n., 188n. 
(S?e alto a) 

Samples, combined, 42. 43 
sampling, biased, 160 
curtailed, 14, 238, 361 

random, role of. *‘n experimentation, 158-160 
representative, li, elihood of bias of, 160 
Sampling inspection, aims of, 156-158 
control features of, 42, 43, 157 
effectiveness oi, 156-168 

(See alto Operating characteristics; Operat¬ 
ing-characteristic curves; Operating-char¬ 
acteristic; surface) 
objectives of, 156—158 
risks involved in, 228, 229 

(See also Consumer’s risk; Producer’s risk) 
i.teps in, 13, 14 

value of, dependent upon quality of submitted 
product, 42 

(See also Acceptance inspection; Inspection) 
Sampling-inspection plans, attributes (see Attri¬ 
butes, ii^pection) 
changes in, 41-45, 142-145 
classification of, 237 

comparison of, 26-28, 85, 237-244, 419, 420 
double, 237-244 
multiple, 237-244 
multivariate, 111-184 

sequential, 83-85, 232, 233, 358-362, 419-428 
single sample, 7-82, 111-184, 231, 232, 237, 
249-264, 269-296, 331-338, 377-381, 385- 
393 

(See alto Acceptance inspection; Inspection; 
Statistical tests) 

Savage, Leonard J., ix, xii, 238n., 244, 354, 364 

Scale readings, 188, 189 

Schcff6, Henry, 109 

Schumpeter, Joseph A., 145n., 183 

Scores, examination, dispersion of, 280 

use of, in acceptance inspection and quality 
control, when several quality character¬ 
istics are involved, 50-62, 113-116 
Sealy, E. H., 156, 157, 184 
quotation from, 156, 157 
Semi-invariants, of rectangular distribution, 195, 
196n. 

Sheppard’s corrections for, 196n., 222 
Sensitivity tests, 341, 342 

analysis of data from, 342-345 
optimum design of, 345, 346, 349-352 
use of preliminary data in, 346-349 
Sequential analysis, 26, 83-85, 232-234, 237-244, 
357-362, 367-372, 417-429 
advantages of, 26, 367-371, 419, 420 
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Sequential analysis, compared with non-sequential 
analysis, 26, 85, 237-244, 367-372, 419, 420 
conditions for superiority of, over non-sequen¬ 
tial analysis, 369, 371, 419, 420 
for superiority of non-sequential analysis, 
371, 420 

definition of, 237, 367, 419 
disadvantages of, 371 
elements of, 417-429 

Sequential design, of experiment, for locating 
maximum of a function, 367-372 
when failure rate of equipment is high, 353- 
362 

Sequential estimation, of parameter of binomial 
distribution, reference to, 238n. 
Sequential-sampling-inspection plans, 83-85, 232, 
233, 358-362, 421-426 
(See also Statistical tests, sequential) 
acceptance line, in graphical procedure .for, 
232, 356, 357, 359, 360, 422, 423, 425 
(See also Acceptance line) 
acceptance number in, 237, 422, 425 
amount of inspection in (see Amount of inspec¬ 
tion, with sequential sampling) 
operating-characteristic curves of (see Operat¬ 
ing-characteristic curves) 
that provides for rejection only, 357-362 
rejection line, in graphical procedure, 232, 356, 
357, 359, 360, 422, 423, 425 
(See also Rejection line) 
rejection number in, 237, 422, 425 
Severity of test, cannot be controlled precisely, 
inspection for fraction defective when, 
331-338 

Sheppard, W. F. f 188, 193, 196, 197, 199, 203, 223 
Sheppard’s corrections, 193-196 
bivariate, references to, 193n. 
conditions for strict validity of, 193 
consequences when not fulfilled, 196 
implications of, 194 

conditions for validity “on the average,” 194- 
196 

when grouping lattice is located at random, 
194-196 

for J-shaped distributions, 194 
references to, 194n., 222 
multivariate, references to, 193n., 223 
not to be used, in computing confidence limits, 
199, 200, 203 

in evaluating tests of significance, 199, 200, 
203 

Shewhart, Walter A., 47n., 49n., 58n., 93, 97n., 
110, 221n., 223 

Short cuts, in analysis of 2 X 2 table, 253-255 
a: standard deviation of lot of population (see 
Standard deviation, of lot; of population) 
©■(JV): standard deviation of number of items 
inspected, 238 

figures showing, for double-, multiple-, and 
sequential-sampling plans, 241, 243 
ffo: standard deviation corresponding to standard 
method, process, or product, 269 
ax: standard deviation corresponding to new 
method, process, or product, 269 


at: standard deviation defining good quality, 
when lot quality is measured by fraction of 
sample ranges that exceed specified limit, 
229-231 

<ri: standard deviation of the first of two methods, 
processes, or products, 280 
at: standard deviation defining bad quality, 
when lot quality is measured by fraction of 
sample ranges that exceed specified limit, 
229-231 

at: standard deviation of the second of two 
methods, processes, or products, 280 
Significance, statistical tests of (see Statistical 
tests) 

Significance levels, of Bartlett’s test, approxima¬ 
tion to, 388 

tables of, reference to, 388n., 394 
of Xe, approximation to, 255 
tables of, reference to, 255 
of x 2 , Cornish-Fislier type expansion for, refer¬ 
ence to, 109 

for one degree of freedom, equivalent to K a a , 
258n. 

relation of, to partial sum of Poisson series, 
109 

to significance levels of F-distribution, 316 
tables of, references to, 315n. 
of Cochran’s test, tables of, 390, 391 
of F, Paulson’s*approximation to, 393, 394 
for n 2 * •, equivalent to significance levels 

* of x 2 , 316 

* relation of, to percentage points of Fisher’s 2 , 

317n. 

of incomplete beta function, 392, 393 
tables of, comments on, 151n., 152n., 317n. 
references to, 15ln., 184, 317n., 318 
Hotelling’s T (see Significance levels, of T) 
meaning of, 270, 271, 281 
of normal distribution, relation of, to probits, 
343 

short table of, 18 

tables of, references to, 10n., 18, 343n. 
need for care in using, lOn. 
of range, in samples of k from normal distribu¬ 
tion, curves showing, 230 
table of, reference to, 229n., 234 
of T, asymptotic expansion for, 155 

relation of, to significance levels of x s > 155 
of F, 155 

of t, asymptotic expansion for, 148, 149n., 184 
tables of, references to, 148n. 
of 2 , relation to significance levels of F, 317n. 
Significant differences (see Statistical tests) 
Significant figures, 189 
definition of, 189n. 

recording excessive number of, 191, 192 
too few, 192 

Simaika, J. B., 316n., 318 
Simon, Herbert A., 381, 382 
Simultaneity of events, acceptance inspection for, 
227-234 

formulation in terms of a, 229-281 
measured by fraction of sample ranges that are 
less than specified limit, 227, 228 
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Single-sampling-inspection plans, 7-82, 111-184, 
231, 232, 237, 249-264, 269-296, 331-338, 
377-381, 385-393 

(S«e also Statistical tests, single-sample) 
acceptance number in, 237, 260n., 263, 264 
amount of inspection (see Amount of inspection, 
with single sample) 
curtailment of, 14, 238 

operating characteristics of (see Operating 
characteristics; Operating-characteristic 
curves; Operating-characteristic surface) 
rejection number in, 237, 260n., 263-274 
Skewness, of distribution of x, when data are 
grouped, 198 

measures of (\Z%yi), 197, 198 
Slope, of linear regression function (0), 322 

determination of, when cost of each mea.sure- 
ment depends upon its magnitude, 31'/ -328 
when dependenf variable is expressed in 
probits, 343, 150 
references to. 3 ,3n., 352 
relation of, to r, 343 l 

two-point estimate of, formula for, life' 
Small samples, cffe< t of grouping or rounding ov 
statistical analyses of, 203-215 
efficiency of, as basis for decisions, 156, 157 
Smith, B. Babington, 120n., 184 
Snedecor, George W., 405, 413 
Solomon, Herbert, ix, 384 
Specifications, inconsistent, 50n., 51n. 

redundant, 50n., 5 In. 

Sponsorship, vii 

Square-root transformation, 411, 414 
Stabilizing variance, by transformation of data, 
408-414 

Standard; new method, process, or product com¬ 
pared with, with respect to fraction defective, 
7-93, 27»., 28n. t 231-234, 237-244, 270, 271, 
274, 275, 278-280, 423 

with respect to level of performance, 17, 27n., 
28n., 59-63, 81, 82, 423 

with respect to variability of performance, 
227-234, 270, 271, 274, 275, 278-280 
(See also Old method) 

Standard deviation, of binomial distribution, 255, 
397 

of correlation coefficient (see Variance) 
of lot (<r), as measure of lot quality, 27n., 28n., 
229-231, 269-296 

(See also Standard deviation, of population) 
of mean (see Variance) 
of mean deviation (see Variance) 
of median (see Variance) 

of normal distribution (<r), 9, 12, 58, 97, 161, 
229, 342, 378, 423, 426 
bias of e as estimator of, 58 

adjustment factor, formula for, 58 
tables of, references to, 58n. • 
as parameter of, 12, 161 
as reciprocal of slope of regression line, when 
dependent variable is expressed in probits, 
343 

of number of items inspected (v(N)), 238 


Standard deviation, of number of items inspected 
(<r(N)), figures showing, for double-, multi¬ 
ple-, and sequential-sampling plans, 241, 243 
of Poisson distribution (see Variance) 
pooled estimate of, 43 

of population (<r), bias of « as estimator of, in 
normal samples, 58 
confidence limits for (see Variance) 
estimation of, from mean deviation, 162, 163, 
169 

from probit data, 342-352 
from range, 227, 227n. 
from s, in normal samples, 58 
from sensitivity tests, 342-352 
(See also ' r ai ; ance) 

importance of dis inguishing from s, 12 
as parameter of normal distribution, 12, 161, 
tests of sigmficai cc relating to, sequential, 
232, 233, 42o 

single-sample, 231, 232, 269-296, 385-394 
of proportion or percentage, 255, 397 
of r (see Variance) 
of range, 227n. 

of rectangular distribution, 218 

of s, in normal samples, formula for, 58 

of sample, 12, 13, 187n., 188n. 

Standard error (see Standard deviation) 

Standard method, process, or product, compared 
with a new (see Old method) 

Stanford University, ix, xi, xii, 96, 384 
Statistical hypothesis, alternative, 419, 420 
null, 420 

under test, 419, 420 

Statistical Research Group, iii, ix-xi, 6, 8n., 20n., 
26n., 34n., 42n., 52n. f 83n., 85n., 93, 101n., 
I08n., 109n., 143n., 184, 188, 231n., 234, 
237n., 244, 257n., 260n., 335n., 358n., 361n., 
362, 400, 405n., 415, 419n., 420n., 428, 429 
senior scientific staff of, list of names, ix, x 
Statistical tests, analysis of variance, 312-314, 
385, 386 

for components of variance, 312-314 
for homogeneity of means. 385, 386 
Bartlett’s, for homogeneity of variance esti¬ 
mates, 183, 386-388 

based on sign of difference between sample 
means, 377-382 

conditions rendering optimal, 381, 382 
bias of, if Sheppard’s corrections are used, 199, 
200, 203 

binomial distribution, based on, 249, 250, 255 
for comparing parameters of two, 249-258, 
424, 425 

for testing parameter of a single, 260n., 263, 
264, 421, 422 

with provision for rejection only, 358 
Xa-test, 254—257 

X*-test, for deciding from si*, whether 9i* > *o*. 
270-278 

for deciding from ai*, whether 9\* < 9o *, 
278-280 

for deciding from "sum of squares” whether 
** > ai*, 231, 232 

for homogeneity of proportions, 250, 257 
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Statistical tests x*-test, for homogeneity of pro¬ 
portions, of 2 X 2 table, 254, 255 
of variance estimates, 386-388 
Cochran's, for significance of largest variance 
estimate, 383-394 

for comparing two binomial distributions, 249- 
258, 203, 264, 424, 425 . 

two means, 61, 62 

two percentages or proportions, 249-258 

263, 264, 424, 425 

two standard-deviation or variance estimates, 
280-297 

for components of variance, 312-314 
curtailed, 14, 238, 361 

for determining whether new method, process, 
or product exceeds a standard, with respect 
to fraction defective, 7-93, 27n., 28n., 
227-234, 237-244, 270, 271, 274, 275, 278- 
280, 421-423 

with respect to level of performance, 17 
27n., 28n., 59-63, 81, 82 
with respect to variability of performance, 
227-234, 270, 271, 274, 275, 278^280 
for determining whether a particular one of two 
methods, processes, or products exceeds the 
other, with respect to fraction defective, 
249-258, 263, 264, 424, 425 
with respect to level of performance, 59-63, 
380 

with respect to variability of performance, 
280, 282. 283, 296 
for difference of means, 59-63, 380 

of percentages or proportions, 249-258, 263, 

264, 424, 425 

for double dichotomy, 249-258, 424, 425 
double-sampling, for fraction defective, 237-244 
effect of grouping or rounding on, with large 
samples, 191-203 
with small samples, 203-215 
errors of the first and second kinds associated 
with, 271, 281, 282 

latest, in analysis of variance, 311-314 
for components of variance, 311-314 
for homogeneity of means, 385-386 
of ratia of standard deviation or variance 
estimates. 280-296 
for four-fold table, 249-268, 424, 425 
for fraction defective, based on enumeration 
data, 237-244, 249-258, 260n., 263, 264, 
421, 422, 424, 425 

based on measurements, 7-93, 27n., 28n., 
227-234, 270, 271, 274, 275, 278-280, 423 
general one-parameter, sequential, 426-428 
single sample, 61, 62 

generalised Student’s ratio (see Hotelling’s 7*) 
for generalized variance, reference to, 166 
graphical, 34-37, 52-56, 66-71, 232, 233, 260, 
356-359, 398, 414n., 416, 422, 423, 425 
grouping, effect of, on, with large samples, 
191-203 

with small samples, 203, 215 
for homogeneity, of means, 61, 62, 385, 386 
of percentages or proportions, 249-258, 263, 
264, 424, 425 


Statistical tests, for homogeneity, of 2X2 
table, 249-258 

of variance estimates, 385-389 
Hotelling’s T, 114-116, 152-156 
indifference value in, 282 

for mean, comparison of two or more, 59-63, 
380, 385, 386 

test of a single, 61-63, 71-73, 83-85, 197, 198, 
200-203, 212, 213, 423, 426 
mechanical (see Fulcher-Zubin Item Analyzer) 
multiple-sampling, for fraction defective, 237- 
244 

multivariate, 50-52, 114-116, 147-156 
number of observations required (see N; 37; 
N a ; N m ; N,; Ni) 

(See also Amount of inspection; ASN curve; 
Computation of; Formulas) 
omnibus, 386 

one-decision sequential binomial test, 358 
one-sided or one-tail, 52, 57, 150, 152, 156, 426 
operating characteristics of (see Operating 
characteristics; Operating-characteristic 
curves; Operating-characteristic surface) 
for order of population means, 377-382 
planning (see Design of experiments) 
for proportions or percentages, comparison of 
two, 249-258, 263, 264, 424, 425 
test of a single, 237-244, 260n., 263, 264, 421, 
422 * 

with provision for rate of failure of equipment, 
355-362 

‘for range in samples of k, formulation in terms 
of sample “sum of squares,” 229-231 
for ratio of two standard-deviation or variance 
estimates, 280-296 
risks associated with, 271, 281, 282 

(See also Operating-characteristic curves) 
sequential, 83—85, 232, 233, 358 362, 421—426 
for sign of difference between population 
means, 377-382 

significance levels of (see Significance levels) 
for standard deviations or variances, compari¬ 
son of two or more, 231-233, 269-296, 385- 
394, 426 

T-test, Hotelling’s, 113-116, 147-156 
t, non-central, 71-73 

Student’s, 72, 147-149, 200-203, 214, 215 
relation of, to Student’s z-test, 200 
two-sided or two-tail, 52-57, 150, 152, 156, 426 
transformation of data in, 257-259, 261-264, 
395-416 

truncated, 14, 238, 361 
for variance components, 312-314 
for variances (see Statistical tests, for standard 
deviations) 

Statistics, sample, contrasted with population 
parameters, 161 
Steffensen, J. F., 221n., 223 
Stigler, George J„ ix 

Student (William Sealy Gosset), 187, 200, 201, 
203, 223 

quotation from, 187, 188 
Student’s t, definition of, 72, 200 
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Student’s t, distribution of, effect of grouping or 
rounding on, with large samples, 200-203 
with small samples, 187, 188, 214, 215 
percentage points of, asymptotic expansions for, 
148, 149n., 184 
tables of, references to, 118n, 
relation of, to Student's z, 200 
Sheppard’s corrections not to be used in com¬ 
puting, 203 

Student’s z, definition of, 187n. 

effect of grouping on, 187, 188 (See also 
Student’s 0 

relation of, to Student’s t, 200 
Systematic variation, 310, 311 

T 

T: angular difference between two binomial 
probabilities Pt * ed Ps, 262, 264 
T : generic symbol for .n estimator of u parameter 
6, 408 

7’: Hotelling « generalis ed Student ra t • dfefinit'on 
of, 142, 154, 155 
distribution of, J55 

asymptotic expansion for percentage p.^nts 
of, 155 

free from nuisance pavmefer&, ! 1i> 
relation to x^distribution, 155 
to F-distribution, 155 
optimum properties of, 147 

as “score," combining results of two or more 
tests, 113-116 

T : Zubin’s symbol for a transformed proportion, 
405, 415 

T-statistics, advantages of, 115, 164 

applied to air testing of bombsights, 122-142, 
164-168 

control charts for, figures showing, 132, 134, 136 
control limits for, computation of, 152-155 
definitions of, and relationships, 131, 133, 153, 
154 

distributions of, 152-155, 164 
uses of, 115 

yardsticks for, computation of, 156 
I: angular difference between two observed pro¬ 
portions ptc and ps, 264 

t : ratio of a normally distributed variable to an 
independent estimate of its standard 
deviation, 72, 73 

(See also Student's t; Non-central /) 
to: Johnson and Welch notation for a given level 
of non-central t, 73 
Tables, checking of, x 

list of principal reference, xiii 
Tabular entries or values, effect of further round¬ 
ing of, 191 

Fisher-Wishart recommendation regarding, 221 
statistical properties of, 215-218, 221 
Tchebycheff inequality, 49, 221 
9: angle corresponding to an observed proportion, 
257, 258, 398, 400 

9: generic symbol for a parameter of a probability 
distribution, 408, 426 

$o: value of 9 when null hypothesis is true, 426 


: value of 9 when alternative hypothesis is true, 
426 

Thompson, Catherine M., 110, 151n., 152w., 184, 
315-318, 333, 335n„ 336n., 338, 388n., 393, 
394 

Tightened inspection, 41-45 

criterion for changing to, when lot quality is 
measured by fraction defective, 42 
procedure for judging whether criterion is 
met, using variables data, 42-45 
when upper and lower limits are imposed 
on a single quality characteristic, 56 
. tfect of, on OC curve, 45 

methods of achieving, with variables inspection 
for fraction defective, 45 
purpose of, 41, 42 

Time to rupture. for test specimens, 321, 365 
determination of, vhen cost of each measure¬ 
ment depends on its magnitude, 322-328 
detei initiation of composition yielding maxi¬ 
mum, 365-372 

l .me trends, effect of, on statistical analyses, 371, 
372 

Tippett, L. H. C., 120n., 184, 227n., 234 

Tolerance factor (K), for normal distribution, 
97-108 

definition of, 97, 98 
formulas for, 101, 108 
t r .ble of values of, 102-107 
uses of, examples of, 100, 101 
(See also Tolerance limits) 

Tolerance limits, statistical, 47, 97-108 
conditions for satisfactory uRe of, 97 
contrasted with confidence limits for popula¬ 
tion mean, 99, 100 
definition of, one-sided, 47, 98, 99 
two-sided, 97, 98 
distribution-free, 99, 109 
for non-normal distributions, 99, 109 
for normal distributions, 47, 97-99, 100-108 
one-sided, 47, 98, 99 

computation of, example, 47 
two-sided, 47n., 97-99, 100-108 
examples of, 100, 101 
factors for, table of, 102—107 
formulas for, 101, 108 

Transformation, angular, 257-269, 261-264, 336, 
395-416 

general, to stabilise variance, 408-410 
inverse hyperbolic tangent. 413 
inverse sine, 257-259, 261-264, 336, 395-416 
logarithmic, of F, 317n. 
of r, 413 
of 8 or a 2 , 413 
probit, 343, 352 
square root, 411, 414 
z- t of correlation coefficient (r), 413 
of variance ratio (F), 317n. 

Transformed correlation (z), 413 

Transformed percentage or proportion (v,T,9) t 
257, 258, 398, 400, 405, 415 
(See also Inverse-sine transformation) 

Trends, effect of, on statistical analyses, 371, 372 

Tukey, John W„ 109, 414n. 
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2X2 table (see Double dichotomy) 

Two-sided criterion, in variables inspection for 
fraction defective, 52-57, 63, 64 
Two-sided or two-tail tests, 52-57, 150, 152, 156, 
426 

Two-sided tolerance limits, 47n., 97-99, 100-108 
U 

U: normal deviate exceeded with probability p, 
72 

U : upper limit of acceptable ranges in sample of k, 
227, 228 

U : upper limit of a measured quality characteris¬ 
tic, above which an item is defective, 9-12 
(See also L) 
selection of, 46, 47 
as tolerance limit, 47 
(7-shaped distribution, 194, 222 
United States Government, vii 

(See also Army; Bureau of the Census; De¬ 
partment of Agriculture; National Bureau 
of Standards; National Defensd Research 
Committee; Navy; Office of Scientific 
Research and Development) 

Universe (see Lot; Population) 

University, Brown, ix 

of Chicago, iii, ix-xii, 6, 248, 320, 340, 364, 418 
Columbia, iii, vii, ix-xii 

(See also Statistical Research Group) 
of North Carolina, ix, xi, 112, 248 
Princeton, ix 

Stanford, ix, x, xi, 96, 384 
of Wisconsin, ix, 354 

Upper and lower limits (see Two-sided criterion; 
Two-sided tests; Two-sided tolerance limits) 

V 

Variables, inspection by, 7-93, 113-184, 227-234, 
269-296, 377-382, 385-393, 423, 426 
advantages of, 7-9 

amount of inspection in (see N ; N', Nm ; N»; Nt ) 
(See also Amount of inspection) 
conditions for advantageous use of, 9 
curtailment of, 14, 15 
disadvantages of, 8, 9 
for fraction defective, 7-93 

acceptance criteria in, 12, 14, 17, 33-41, 52- 
56, 67-71, 85 

(See also Computation, of acceptance re¬ 
gion; Formula, for acceptance criterion; 
Graphical procedure) 
advantages of, 7-9 

amount of inspection, 17, 21, 27, 28, 64-66, 
81, 82, 85 

(See also Amount of inspection; N\ JJ; 
Nmi N»; Nt) 

condition for advantageous use of, 9 
contrasted with attributes inspection, 7-9, 
26-28 

oriteria, use of pair of one-sided in, 53, 54 
figure showing, 55 

oriterlon, use of single two-sided, 52-57,63,64 


Variables, inspection by, for fraction defective, 
criterion for changing to reduced or tight¬ 
ened inspection in, 42 
formulas for (see Formula) 
graphical procedure in (see Graphical proce¬ 
dure) 

when inspection is costly, 9 
when lot mean 00 is known, 27n., 28n., 
230-234, 270, 271, 274, 275, 278-280 
when lot standard deviation (<r) is known, 
27, 28, 61, 62, 423 

operating characteristics of (ace Operating 
characteristics; Operating-characteristic 
curves; Operating-characteristic surface) 
plan of action, 14, 16-21, 33-41 
choice of, 45-49 
sequential, 83-85 
single sample, 7-82 

when upper and lower limits are imposed on 
a single quality characteristic, 52-57, 63, 64 
for fraction of ranges in samples of k that 
exceed specified limit, 227-234 
formulation in terms of cr, 229-231 
multivariate, 113-184 

operating characteristics of (see Operating 
characteristics; Operating-characteristic 

curves; Operating-characteristic surface) 
for percent defective (see Variables, for fraction 
defective) 

sequential-sampling, 83-85, 232, 233, 423, 426 
single-sampling, 7-82, 113-184, 231, 232, 269- 
296, 377-382, 385-393 

use of several acceptance criteria in, 50-56, 63, 
64, 113, 114 

Variability, determining whether new method, 
process, or product exceeds a standard in 
respect to, 227-234, 270, 271, 274, 275, 278- 
280 

determining whether a particular one of two 
methods, processes, or products exceeds 
the other in respect to, 280, 282, 283, 296 
measures of (see Mean deviation; Range; 
Standard deviation; Variance) 

Variance, analysis of, 297, 310-314, 385, 386 
(See also Analysis of variance) 
of binomial distribution, 397, 410 
components of, 269, 297, 310-314 
due to grouping or rounding, 195, 218 
of correlation coefficient (r), 413 
definition of, 161, 195 

estimates of, 12, 13, 43, 162, 198 (See also §*) 
X*-test of, 231, 232, 270-280 
F-tests of, 280-296, 311-314 
homogeneity of, Bartlett's test for, 183, 387, 
388 

largest of several, Cochran's test for sig¬ 
nificance of, 383-394 

experiments pertaining to, design and analysis 
of, 227-234, 269-318 
of grouped distribution, 195 
of interpolated value, 219-221 
of lot (<r*), as measure of lot quality (see Stand¬ 
ard deviation, of lot) 

(See also Variance, of. population) 
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Varianoe, of maximum likelihood estimator of 
mean of normal population, when data are 
grouped or rounded, 197 
of mean, 162, 198 

when data are grouped or rounded, 197, 198 
of mean deviation, in no5 mal samples, 162,163, 
169 

of median, in normal samples, 162 
of normal distribution, 161, 198 

(See also Standard deviation, of normal 
distribution) 

bias of « 2 as estimator of, when data are 
grouped or rounded, 198-200, 203-214 
estimation of, from mean deviation, 162, 163, 
169 

from probit data, 342-362 
from range, 227 

from «*, 12. 1?, 43, 162, 198-200, 203-214 
from sensitivity tests, 342-352 
as parameter . 12, 161 
tests of signify ance ;-dating *o, wqojnti'i, 
232, 233 4*6 

single-gam; <!»*, 231, 232, .io9--290, 38 -394 
of PoUwo u dim ib it ion, 411 
of population (<rj, bias of y~ is estim&'or of, 
when data are grouped or rounded, 198- 
200, 203-214 

confidence limits for, 198-200, 213, 214 
estimation of, from mean deviation, 162, 
163, 169 

from probit data, 342-352 
from range, 227 

from a 2 , 12, 13, 43, 162, 198-200, 203-214 
from sensitivity tests, 342-352 
importance of distinguishing from a 2 , 12 
tests of significance relating to, sequential, 
232, 233, 426 

single-sample, 231, 232, 269-296, 383-394 
of proportion or percentage, 397, 410 
of r, 413 

of range (see Standard deviation, of range) 
ratio (see F) 

of rectangular distribution, 195-218 
of a 2 , in samples from arbitrary, grouped, and 
normal populations, 199 
from normal population, formula, 199, 412 
when data are grouped oi rounded, 
formula, 199, 211 

stabilizing, by transformation of data, general 
procedure for, 408-410 
applications of, 410-413 

(See also Inverse-sine transformation) 

W 

W or Wh'. multiple of <r in terms of which cumula¬ 
tive distribution of sample range in samples 
of k (Rk) is given, 229,230 
to: width of grouping or rounding interval, 193 
{see Grouping interval) 

Wald, Abraham, ix, 52»., 83n., 99, 108n., 110, 
143n., 147, 166n., 184, 419n. r 429 


Wallis, W. Allen, iii, x-xii, 6,167n„ 184, 248, 418, 
419n. 

Weaver, C. L., 318n. 

Weaver, Warren, v, vii, x 

Weights, use of, in fitting line by least squares, 
343, 344 

when dependent variable is expressed in probits, 
343, 344, 350 

Welch, B. L„ 8n„ 22, 49, 58n., 64, 72-74, 76, 77, 
79n„ 86, 93, 99, 381, 382 
quotation from paper by, with W. J. Jennett, 
49-50 

'Vilenski, H. 46,i„ 93, 255n„ 265 
Wilks, Samuel S„ 99, 110, 166n„ 173, 174, 184 
Wishart, John, 218, 219»„ 220n„ 221, 222 
Wold, ferman, 19‘Wt., 196n., 223 
Wolfowlts, Jacob, xi, 52n., 108w., 110, 330 
Woik sheets, foi acoeptance-region boundary, 
using Iwo-si.lod criterion, 57 
for k, 77-79 
or OU cuive, 32, 80 

WPA tables (see Mathematical Tables Project) 
Wright, Sewn 11, 398 

X 

r: a measurement of a variable quality charac¬ 
teristic, 12 

£: arithmetic mean of Bample measurements (see 
Sample mean) 

£ + ks, 12, 14, 33, 36, 47n., 50, 53, 59, 61, 62, 64, 
66, 71, 83 

(See also Acceptance criterion, in variables 
inspection for fraction defective; Toler¬ 
ance limits) 

bias of, as estimator of n + hr, in normal 
samples, 58 

expected or mean value of, in normal samples, 
58 

standard deviation of, in normal samples, 58 
£ - ks, 13, 47n„ 53 
X + Ks, 98, 99, 102 

(See also Tolerance factor; Tolerance limits) 
£: mean of a normal distribution, 164 

Y 

Yardsticks, for differences of means, 150 
for T-statistics, 156 
for variance estimates, 156 
Yates, Frank, 18n., 93, 120n., 148n., 151n., 152n., 
183, 234, 255, 258n„ 265, 315n„ 317n„ 318, 
342n„ 343n„ 352, 398, 399n„ 405, 407, 415 
Yates’ correction of x* for continuity, 254, 257 

Z 

z: a normally distributed variable, 61,73 
z-transformation, of correlation coefficient (r), 
413 

of variance ratio (F), 317n. 

Zubin, Joseph, 257, 258, 265, 398, 400 
Zubin-Fulcher Item Analyzer, 257, 258, 405, 415 






